ajottiell Imucraits ffitbrarg 

atliata. Sew ?"* 

BOUGHT WITH THE INCOME OF THE 

SAGE ENDOWMENT FUND 

THE GIFT OF 

HENRY W. SAGE 

1891 



Cornell University Library 
QP 355.B88 1921 



Essentials of mental measurement, 




1924 024 562 211 




Cornell University 
Library 



The original of this book is in 
the Cornell University Library. 

There are no known copyright restrictions in 
the United States on the use of the text. 



http://www.archive.org/details/cu31924024562211 



The Cambridge Psychological Library 



THE ESSENTIALS OF 
MENTAL MEASUREMENT 



CAMBRIDGE UNIVERSITY PRESS 

C. F. CLAY, Manager 

LONDON : FETTER LANE, B.C. 4 




LONDON : H. K. LEWIS AND CO., Ltd. 

136, Gower Street, W.C. 1 
NEW YORK :'thE MACMILLAN CO. 
BOMBAY 1 

CALCUTTA \ MACMILLAN AND CO., Ltd. 
MADRAS J 
TORONTO : THE MACMILLAN CO. OF 

CANADA, Ltd. 
TOKYO : MARUZEN-KABUSHIKI-KAISHA 



ALL EIGHTS BESEBVED 



THE ESSENTIALS 

OF 

MENTAL MEASUEEMENT 



BY 

WILLIAM BROWN 

M.A., M.D. (OxoN.), D.Sc. (Lond.) 

READER IN PSYCHOLOGY IN 
THE UNIVERSITY OF LONDON (KING's COLLEGE) 

AND 

GODFREY H. THOMSON 

D.Sc. (DuNELM.), Ph.D. (Steasburg) 

PROFESSOR OF EDUCATION, ARMSTRONG COLLEGE, 

NEWCASTLE-UPON-TYNE 

LATE PEMBERTON FELLOW OP THE UNIVERSITY OF DURHAM 



CAMBRIDGE 

AT THE UNIVERSITY PRESS 

1921 



PREFACE 



Numerous changes, in the form of additions, omissions, and altera- 
tions, have been made in this edition. Chapters I, V and VI remain 
with little alteration. Chapters III and VIII have been expanded and 
altered, while Chapters II, IV, VII, IX and X are entirely or almost 
entirely new. These changes are wholly the work of Professor Godfrey 
H. Thomson, and the merit of the very great improvement in the book 
is due to him. Owing to my own time during the War being taken up 
entirely with army medical work, I have been unable to take any part 
in the further development of correlational psychology since 1914, and 
it was therefore a very great rehef to me when Professor Thomson 
kindly ofEered.to cooperate in bringing out a second edition of Mental 
Measurement. No one could have been more fitted for this task, both 
on 'the mathematical and on the psychological side, and he has produced 
what is in half its extent, a new book, vastly superior to the old. Some 
of the sections are purely mathematical, and these will perhaps appeal 
to a wider circle of readers. 

As this second edition, like the first, is opposed to the theories of 
Professor C. Spearman, I wish to take the present opportunity of saying 
that as an opponent I have learnt to respect and admire his" work to 
a very great degree, and looking back dispassionately over six years 
spent in other forms of psychological research, I find myself more con- 
vinced than ever that his work in correlational psychology is epoch- 
making in its significance. Although I have found no occasion to 
withdraw any of my own earlier criticisms, I have gladly taken the 
opportunity of modifjang the wording in many respects, in order to do 
better justice to the outstanding importance of his views. Whatever 
may be the ultimate verdict on the significance of "hierarchical order" 
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in groups of correlation coefficients. Professor Spearman's correlational 
work in other directions would alone justify his unique position in the 
domain of statistical psychology. 

It has seemed to me to be essential that Professor Thomson should 
contribute a separate preface, since so much of this edition is concerned 
with the vindication of a theory which originated from his brain alone. 
Should a third edition be called for later on, I hope to be able to add 
a "Part III," summarising results and theories in non-mathematical 
language. But at present the battle rages so hotly that this is not 
desirable, even if it were possible. 

W. B. 
King's College, 
London. 

August, 1920. 



II 

Dr Brown's suggestion that we should write separate prefaces to 
this book appeals to me for one reason only, that it enables me to 
shoulder, myself, the responsibility for any errors either in principles 
or in detail which this edition may contain. Any credit I am glad to 
share, but the faults are my own. 

The book is necessarily concerned largely with controversial matters. 
On one of these, the question of the validity of the reasoning which 
Professor Spearman has based on the occurrence of hierarchical order, 
I am entirely convinced that the situation is as described in Chapters IX 
and X. My position is that hierarchical order is the natural order among 
correlation coefficients, that it only expresses the well-known fact that 
correlation coefficients are themselves correlated, and that the degree 
of perfection of hierarchical order found among psychological correlation 
coefficients is merely that which occurs by chance, and not, as Professor 
Spearman has been led to believe, extraordinarily high. 

On this point, then, I feel certain that Professor Spearman has drawn 
over-hasty conclusions, conclusions which may be, by a fine instinct, 
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the true conclusions, but conclusions which are totally unsupported by 
the "hierarchy" argument. 

On a second point at issue, however, namely the question of the use 
of ranks instead of measurements in calculating correlations, I feel much 
less competent to speak. There is a good deal to be said on both sides. 
The crux of the question to my mind is this. Admittedly the form of 
distribution of any set of scores in a mental test is not necessarily 
identical with the distribution of the true scores in mental units. If 
we give up the actual scores and merely retain the order of merit we 
are, it would seem, only giving up a false claim to accuracy and re- 
stricting ourselves to what we are certain of. But when we come to 
perform the actual calculations we find that we are again assuming a 
form of distribution of the mental quantity, usually a normal distri- 
bution, but sometimes some other, as in the "Foot-rule." It comes 
therefore to this, that in calculating rank-correlations we are refusing 
to take the distribution found and are substituting for it one not found. 
The question then is whether it is more probable that the assumed or 
the measured distribution corresponds to the actual unknown mental 
distribution. I think that each case would have to be taken on its 
merits: but I do feel that in the actual measurement the attempt 
should be made to approximate to the true distribution, by comparing 
differences carefully, as I suggest on pp. 11 — 12. 

However, I wish to make it clear that there is not, on this question 
of the use of ranks, that sharp cleavage of opinion between Professor 
Spearman and myself which cannot be disguised in the case of the 
General Factor argument. 

A third question is that of the correction of raw correlation coef- 
ficients, in which I share to a considerable extent Dr Brown's misgivings 
as to Professor Spearman's formula, and on which Dr Brown has, since 
the first edition of this book, added experimental evidence as to that 
correlation of errors and true values which the Spearman formula 
assumes to be zero (see pp. 160 — 3). 

My thanks are due to the President and Council of the Royal Society, 
and to the Editors of the British Journal of Psychology, the American 
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Journal of Psychology, the Psychological Review, and Biometrika, for 
their kind permission to reproduce long extracts from articles in the 
publications controlled by them: to Professor F. M. Urban and his 
publisher Mr W. Engelmann for permission to print Urban's Tables for 
the Constant Method : to Dr E. G. Boring of Clark University for valuable 
suggestions: to Dr G. J. Rich for his permission to print his checking 
table : to my colleague Dr G. R. Goldsbrough for reading some mathe- 
matical pages of the proofs : and to Professor Spearman for reading the 
proofs from p. 65 onwards, and the MS. of this preface. Acknowledg- 
ments to authors cited are made by footnotes, but should any omission 
in this or any other respect have occurred, my sincere apologies are 
tendered for the mistake. 

I have welcomed the opportunity which this book has afEorded of 
printing in one volume arguments which, after a delay due to four 
years' military service, I was compelled to publish piecemeal, during 
1919 and the earlier months of the present year, in various scattered 
papers: and I would like finally to thank Dr Brown himself for per- 
mitting his book to be thus mauled by another and will close as I began 
by begging that none of my crimes may be visited on his head. 



G. H. T. 



Abmstbong College, 

Newcastle-upon-Tyne. 
August, 1920. 
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PAETI 
PSYCHOPHYSICS 

CHAPTER I 

MENTAL MEASUREMENT 

Equal appearing intervals — Just perceptible distances — The interpretation of 
Weber's Law — Indirect methods of measmement — ^The approach to measurement 
by means of grading magnitudes and their differences. 

(1) EQUAL APPEARING INTERVALS 

The pre-conditions* of measurement in any sphere of experience are 
(1) the homogeneity of the phenomena, or of any particular aspect of it, 
to be measured, (2) the possibility of fixing a unit in terms of which the 
measurement may be made, and of which the total magnitude may be 
regarded as a mere multiple or sub-multiple. These pre-requisites are 
satisfied in the cases of spatial and temporal magnitudes, in terms of 
which, directly or indirectly, all the measurements of the physical 
sciences are expressed. It was thought by Fechner that they are also 
satisfied in the case of the strictly psychical phenomena of sensation- 
intensity, i.e. it was assumed that any given sensation-intensity might 
be regarded as made up of a sum of unit sensation-intensities. This 
view has been definitely rejected by later psychologists. Every sensation- 
intensity is qualitatively distinct from every other sensation-intensity. 
No addition or subtraction of intensities is possible. "To introspection, 
our feeling of pink is surely not a portion of our feeUng of scarlet; nor 
does the light of an electric arc seem to contain that of a tallow-candle 
in itself" (James)f. Fechner's mistake was due to a confusion of 

* These pre-conditions are those usually stated. But the idea of measurement has 
been so expanded during recent generations by the mathematical ideas of continuity, 
infinity and limit that they are becoming inadequate as a statement of the position. 
Compare the last section of the present chapter, on the approach to measurement by 
means of grading magnitudes and their differences. 

t Principles of Psychology, i. p. 546. 
B. & T. 1 
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sensation-intensities with the (physical) stimulus-values required to 
produce them. 

Nevertheless, purely psychical measurement is not entirely im- 
possible. Within any one series of sensation-intensities, e.g. a series of 
greys, the contrasts or " distances " separating different pairs of intensities 
are perfectly homogeneous with one another and can be measured in 
terms of one another or in terms of an arbitrarily chosen unit of " sense- 
distance." Given two brightness-intensities a and h, it is quite possible 
to find, within limits of error, a brightness-intensity c which is as much 
higher than 6 in the scale of intensities as h is than a, i.e. such that the 
sense-distance 6c = the sense-distance ah; or, again, it is quite possible, 
theoretically, to find a brightness-intensity d, which bisects the sense- 
distance ah, i.e. which is such that it is as far removed from a in the 
scale of intensities as 6 is from it — in symbols, ad = db. Hence the 
"distance," or disparity, of h from a is twice that of d from a, the distance 
of c from a is four times that of d from a. If, now, ad, or the distance 
of d from a, be taken as a conventional unit, the values of ah and ac 
will be 2 and 4 respectively*. 



— >- a 

Scale of 
brightness-intensities 



rig. 1 



A scale of intensities may in this way be formed rising by "equal- 
kppearing intervals " or sense-distances, and the magnitude of any given 
interval may then, theoretically, be read off in terms of the unit-distance 
employed in the construction of the scale. In practice, however, it is 
found more convenient to fix the successive scale-marks, the successive 
members of the intensity-series, in terms of their corresponding stimulus- 
values. It has been found by experiment that in the case of light- 
intensities and sound-intensities the successive stimulus-values form, 
with fair approximatipn, a geometrical progression, or, in other words, 
each stimulus-value divided by the immediately preceding one gives 
approximately the same quotient. From the stimulus-values corre- 
sponding to an ascending series of eight equidistant brightness-values, 
Ebbinghaus obtained the following series of quotients: 

2-3 2-1 2-1 1-8 1-7 1-7 2-0 

* This view of mental measurement in terms of sense-distance first originated with 
J. R. L. Delboeuf, Revue PhilosopMque, 1878, v. p. 53. His term for sense-distance was 
"contraate sensible. 
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The quotient value is not entirely constant, being slightly greater 
towards the two ends of the scale than it is at or about the middle. For 
this central region, then, the general relation of the sense-distance to 
the stimulus- values is given by the logarithmic formula : 

„ ,. , T 7 , stimulus at b 

Bense-distance ab = k log — : , ■ — , 

stimulus at a 

where the stimulus at a is one which gives any finite intensity of sensa- 
tion taken as the starting point or conventional zero (N.B. it is not 
necessarily liminal); the stimuli at a and b are those which correspond 
to the sensation-intensities of which ab is the "contraste sensible" or 
sense-difference. 

(2) JUST PERCEPTIBLE DISTANCES 

A mode of procedure which not only admits of much wider practical 
application than the above-mentioned "method of mean gradations," 
but also possesses a peculiar historical importance, is that which is 
concerned with the determination of the stimulus-increments corre- 
sponding to just-noticeable increments of sensation-intensity in different 
parts of the intensity scale. Weber found, in a series of experiments 
chiefly with lifted weights, that this stimulus-increment was relatively, 
not absolutely, constant for different regions of the intensity scale, i.e. 
that the stimulus corresponding to any original sensation-intensity had 
always to be increased by a constant proportion to arouse a just-noticeable 
increment of the sensation-intensity. If a 103 grams weight is just 
noticeably heavier than a 100 grams weight, then the weight just 
noticeably heavier than a 200 grams weight will be a 206 grams weight, 
not 203 grams. 

Mathematically formulated, Weber's Law is : 

S (stimulus) ^ ^ 

— V: = — = a constant. 

stimulus 

The quantity S (stimulus)/stimulus — j-f^ or -03 in our example of weight- 
Ufting — ^is known as the " relative difference hmen." It is of course the 
average value of a considerable number of determinations. 

Fechner verified Weber's Law in many different realms of sensation- 
intensity, and made it the basis of his own system of mental measure- 
ment. This he did by making the following three assumptions: 

(1) that a sensation-intensity is a measurable magnitude and may 
therefore be regarded as a sum of unit-intensities ; 

(2) that just-noticeable differences of sensation-intensity are equal 

1—2 
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at different parts of the stimulus scale, and may therefore conveniently 
serve as the unit-intensities above-mentioned; 

(3) that the just-noticeable difference of sensation may be treated 
as a difference of two sensations, or at least that if "Weber's Law apphes 
to the former ("sensed difference") it will also apply to the latter 
( ' ' difference sensation " ) . 

On the basis of Weber's Law and these added assumptions, Fechner 
obtains the following formula, viz. 

, , ^. , d (stimulus) 

a (sensation) = c — ^-^ -, , 

stimulus 

which he calls th.e fundamental formula for menial measurement. Inte- 
grating, this becomes 

sensation = c log^ stimulus + G. 

Putting the stimulus in this equation equal to the stimulus T for which 
the sensation is just below the threshold of consciousness, i.e. = 0, we 

^^^' = clog,T + C. 

Subtracting the second from the first equation, 

, stimulus 
sensation = c log^ ■= . 

Putting T = 1, and transferring to the ordinary logarithm system, we get 
sensation = k log stimulus. 

All the assumptions involved are questionable. The first one has 
already been considered at some length. It is not the single sensation- 
intensity which is measurable, but the distinctness, disparity or distance 
of one sensation-intensity from another. 

We must therefore regard the just-noticeable difference, not as a 
difference of two sensation-intensities but as a minimal sense-distance, 
if we are to be able to make use of it in our scheme of mental measure- 
ment. This modification, however, still leaves us involved in the diffi- 
culties of assumptions (2) and (3). 

Fechner's own reason for regarding all just-noticeable differences 
belonging to any one scale of intensities as equal was that they appear 
equal to introspection. Introspection in a case like this is obviously 
difficult, even for the most skilful observers, and its verdict cannot, 
therefore, be greatly relied upon. Theoretically it is quite conceivable 
that just-noticeable differences, though equivalent to one another as 
being all just-noticeable, i.e. as being sense-distances so small that the 
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slightest diminution of them would cause them all, equally, to cease to 
be noticeable, yet as noticed or perceived would appear of different 
magnitude one from another. Ebbinghaus points to the analogous case 
of differentials or infinitesimals in mathematics. These are all equivalent 
to one another as being all equally negUgible as compared with finite 
magnitudes, yet are by no means necessarily equal to one another. If 
they belong to different " orders," those of a higher order are negligible 
as compared with those of a lower, etc. Again, "the least distances 
perceived as such at different parts of the skin or in direct and indirect 
vision do not by any means all appear as equal magnitudes. On the 
contrary, so soon as they come to consciousness as distances they are 
at once perceived as distances of varying size, in a certain approximation 
to their objective differences*." In spite of these considerations, 
Ebbinghaus regards the correspondence of the stimulus results obtained 
for equal appearing intervals and for just perceptible intervals in the 
case of brightness-intensities (in the middle region of the scale both 
series of stimuli form a geometrical progression) as sufficient evidence 
for the approximate equahty of the latter intervals. Miiller and Wundt 
had previously advanced the same argument. 

Several experimental investigations have been made with the express 
purpose of testing the relation of the methods of minimal change and 
mean gradations. Titchenerf sums up the theoretical basis of such 
experiments concisely as follows: "There are in reahty two possible 
ways of working. (1) We might take a series of stimulus- values, corre- 
sponding to a series, say, of eight successive just-noticeable differences of 
sensation, and thereafter directly compare the two half-distances, of four 
just-noticeable differences each, and decide upon their equahty or in- 
equality. Thiswould be a direct method of experiment." " Or (2) we might 
determine a few just-noticeable differences of sensation at different parts 
of the stimulus scale, in order to estabhsh the constancy of the relative 
difference Umen, and thereafter work with suprahminal differences, and 
decide whether the same constancy holds. This would be an indirect 
method; it is the method indicated by the authors just cited [Miiller, 
Wundt, Kohler and Tannery].... Either of these two methods would, 
presumably, take us to our goal. The experimental work would be 
exceedingly difficult. Liminal determinations are always and intrinsi- 
cally difficult; and, further, the judgments passed upon just-noticeable 
differences and upon suprahminal differences are, even under the most 

* Ebbinghaus, Orundzuge der Psychologie, 2nd ed., 1905, p. 524. 
f "Experimental Psychology," n. Instructor's Manual, p. Ixxvii. 
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favourable conditions, the expressions of radically different mental 
attitudes." 

It should be mentioned here that the principal rival to Fechner's 
hypothesis — the "difference hypothesis" — is that first formulated by 
Plateau, and generally known as the "quotient hypothesis." Plateau 
adopted the psychophysical formula 

sensation = c (stimulus)* 
on the basis of experiments by the method of mean gradations. This 
implies, in the place of Fechner's fundamental formula, the formula 
8 (sensation) _ -.^ 8 (stimulus) 
sensation , stimulus ' 

in other words, it assumes that just-noticeable differences are relatively, 
not absolutely, equal sensation-magnitudes. Although Plateau himself 
withdrew his formula later, the "quotient hypothesis" still remains as 
the rival of Fechner's "difference hypothesis." 

To return to the experiments. Merkel (1888) worked with bright- 
nesses, pressures, and noises, and found that the stimulus corresponding 
to the sensation bisecting a supraliminal sense-distance was the arith- 
metical mean of the stimuli corresponding to the two terminal sensations, 
which would seem to support the quotient hypothesis, though such an 
inference is not entirely free from objection. Angell (1892) worked with 
noise-intensities by the method of mean gradations, avoided certain 
sources of error present in Merkel's form of procedure, and obtained 
results supporting the difference hypothesis, the stimulus of the bisecting 
sensation being the geometrical, not the arithmetical, mean of the 
terminal stimuli. 

A more thorough investigation was carried out by W. Ament* in 
1900. He used a series of Marbe greys for the brightnesses and employed 
the direct method; for the noise-intensities he used a Fechner sound 
pendulum and worked principally by the indirect method. The result 
reached supported the quotient hypothesis. But Ament's work has not 
escaped criticism. A repetition of his experiments on brightness- 
intensities by Frobesf has failed to confirm his results. Ebbinghaus 
also has found, in careful experiments with rotating sectors, that just- 
noticeable differences in different parts of the scale of brightnesses are 
equal to one another. On the whole, therefore, the balance of evidence 
seems to be in favour of the "difference hypothesis." 

* W. Ament, "Ueber das Verhaltnis der ebenmerklichen zu den uebermerklichen 
UntereoMeden bei Lioht- und Sohall-intensitaten," Phil. Studien, 1900, xvi. pp. 135 ff. 
f Zeitschrift far Psychologie, xxxvi. p. 344. 
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(3) THE INTERPRETATION OF WEBER'S LAW 

Fechner's third assumption — see above, p. 4 — brings us to the 
question of the interpretation of Weber's Law. 
There are three general forms of interpretation : 

(1) the psychophysical (Fechner), 

(2) the physiological (Miiller, Ebbinghaus, James, etc.), 

(3) the psychological (Wundt). 

According to (1), the logarithmic transition takes place in passing 
from the physiological changes in the sensory centres of the cerebral 
cortex to the corresponding sensation-intensities. The chief objection 
to this view of Fechner's is that Weber's Law is not exact. This same 
consideration supports (2), or the physiological view, according to which 
the transition occurs either at the inception of the stimulus in the sense 
organ, or somewhere between the nerve endings and their central con- 
nections in the sensory areas of the cortex. Experimental results 
obtained by Waller* and Steinachf are in favour of this view. Waller 
stimulated a frog's eye with light of different intensities, and found that 
the corresponding "negative variations" set up in the optic nerve varied 
in intensity as the logarithm of the stimulus-values (approx.). Steinach 
obtained similar results on stimulating the skin of a frog's thigh with 
weights and noting the negative variation in the attached nerve. If the 
negative variation may be assumed to be proportional to the intensity 
of the nerve-current passing along the nerve, these results point to the 
conclusion that the logarithmic transition occurs in the sense organ and 
its sensory nerve endings. 

Ebbinghausf has constructed a theory based upon the conception 
of varying degrees of dissociability of complex molecules to account for 
the law and also for the deviations from it towards the two extremes of 
the intensity scale. 

The psychological view, (3), of the law, held by Wundt, regards it 
as a special case of the general psychological "law of relativity." 
Stimulus, physiological process, and pure sensation-intensity increase in 
simple proportion to one another. The logarithmic transition occurs in 
passing from mere sensation and sensation difference to apperceived 
sensation and apperceived sensation difference, and the intensities are 

* Waller, "Points relating to the Weber-Fecbner Law," Brain, 1895, xvm. p. 200. 

t Steinach, " Elektromotorische Erscheinungen an Hautsinnesnerven bei adaquater 
Reizung," Pfliiger's Arch. 1896, Bd. 63, S. 495. 

t Ebbinghaus, "Ueber den Grund der Abweichungen von dem Weber'schen Gesetz 
bei Liohtempfindungen," Pfliiger's Arch. 1889, Bd. 54, S. 113. 
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apperceived always in relation to one another. In addition to the 
objection that it regards the sensation-intensities themselves, not their 
distances from one another, as measurable magnitudes, this view is also 
open to the criticism that it furnishes no explanation of the widely- 
varying size of the relative difference limen in different sense-depart- 
ments (e.g. DL for brightness-intensities = 3-^, DL for sound-intensities 
= about I) ; the physiological view sees in the varying structure of the 
different sense organs the adequate explanation of this. Moreover, the 
psychological view has no completely satisfactory explanation to give 
of the deviations from Weber's Law so frequently met with. 

These objections and difi&culties make the view improbable, but by 
no means prove it to be impossible. It has the great merit of emphasising 
more definitely than was heretofore the case the importance of the more 
purely psychological factors in psychophysical experiments — ^in par- 
ticular, it brings into prominence the distinction between mere disparity 
of sensation-intensities present simultaneously or in immediate succession 
in the same consciousness, and the perception of this disparity, the 
discrimination of the intensities one from another. In psycho-physical 
experiments the subject's consciousness is not limited to the mere 
sensational level. 

In this connection the distinction, explained in Chapter III of the 
present book, on page 75, between two essentially different measures 
of a subject's fineness of discrimination, is not without importance, for 
those two measures, as far as experimental work goes, are both believed 
to obey Weber's Law. The one is the difference threshold spoken of in 
the present discussion, the other is there defined and named the inter- 
quartile range of the point of subjective equaUty : and the two quantities 
appear to differ in the level, sensational or perceptual, at which they 
stand. The extended idea of the probability of a judgment there defined 
throws this into prominence. 

Another fact which would seem to have a very immediate bearing 
on the question of the interpretation of Weber's Law is that plants in 
their response to the stimulus of gravity (Geotropism) appear to obey 
that law"". 

Although with continuous increase of stimulus-intensities the corre- 
sponding sensation-intensities rise in steps, each representing a just- 
noticeable difference, the psychophysical relation is really a strictly 

* H. Fitting, Jahrbuch f. wise. Botanik, Bd. XLi. 1905; F. Darwin, New Phytologist, 
1906; James Small, AnnaU of Botany, xxxi. April 1917; James Small, Proc. Roy. Soc. B, 
xo. 1918. 



OH. I] MENTAL MEASUREMENT 9 

continuous one, as becomes at once obvious if we consider a special case. 
The sensation-intensity aroused in lifting a weight of 100 grams is 
"indistinguishable," as we say, from that aroused by 102 grams, the 
sensation aroused by 102 grams is "indistinguishable" from that aroused 
by 104 grams; yet the sensation aroused by 100 grams is perceptibly 
different from that aroused by 104 grams. Thus the sensation-intensity 
increases continuously, and the reason why this is not immediately 
apparent is probably to be looked for in the physiological mechanism 
of the psychophysical organism. The fact is that statements like the 
above as to two sensations being "distinguishable" or not lack precision 
in the absence of a definition of what is to be meant by distinguishable. 
To introspection almost all sensations are distinguishable from one 
another inasmuch as we seldom will agree that two are identical. More- 
over, although a man will not (in the case of a subject of average sensi- 
tivity) give a majority of answers heavier in comparing 102 grams with 

100 grams, yet the number he does give will, if the experiment is 
sufficiently carefully performed, be greater than he will give with 

101 grams as the comparison weight (100 still being the standard). 
Although therefore he does not give, either with 101 or with 102 grams, 
a majority of answers heavier in comparing them with a standard of 
100 grams, yet he does distinguish them from one another (if we take 
the result of a number of experiments), giving more heavier answers 
with the heavier weight*. 

Delboeuf held that the limen has no psychological importance 
whatever. If this is an extreme view, the importance which Fechner 
attributed to the limen is equally extreme in the other direction. 

The absolute or stimulus limen is similar in kind to the difference 
limen, since consciousness is never empty of sensation-intensities when 
such a limen is being determined. Here again, Fechner's rigid distinction 
of the two was a fundamental error. 

(4) INDIRECT METHODS OP MEASUBEMENT 

The preceding account has probably sufficed to show that purely 
psychical measurement is a conceivable possibihty. Its practical appli- 
cation however has been more detailed than extensive. A more generally 
useful method in quantitative psychology is that which measures the 
external, physical or physiological, causes and effects of mental process. 
The measurements are made in terms of the physical units of space and 

* Compare Poinoare, Science and Hypothesis, Soott, 1905, p. 22; and G. H. Thomson, 
British Association Reports, 1913, paper under sub-section I. 
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time, yet they are not merely physical measurements, since they derive 
all their significance from the correlated psychical processes.- They are 
indirect psychical measurements*. Measurements of reaction-times, 
memory, fatigue, illusions, etc. are all of this nature. Their varieties 
are innumerable, and are illustrated by the accounts in any good text- 
book of experimental psychology (Sanford, Titchener, Myers). In all 
cases full introspective accounts are essential, and when correlated with 
the measurements make the latter essentially psychical measurements. 
Measurements of Umina, referred to in the previous section, are of the 
same nature. They are of some special importance as being measures of 
sensory acuity, etc. — aspects of the total mental abihty of psycho- 
physical organisms. They figure prominently in many researches based 
on the use of "mental tests." 

A method which makes a partial return to the more purely psychical 
form of measurement in terms of "distance" is the method of ranks or 
grades. Suppose we are considering the relative abihties of, say, 100 boys 
in Enghsh Composition. We should find it difficult to mark their essays 
individually in terms of any constant unit but might find it possible to 
arrange them in order of merit, especially if we had sufficient time at 
our disposal to employ the method of "paired comparisons." According 
to the procedure of this latter method, the essays would be taken in 
pairs, quite at random, and the better essay of each pair would be given 
a "preference mark." This procedure would be repeated again and again 
until every essay had been compated with every other essay. The order 
of merit is then given by the number of preferences attaching to each 
essay. In this order, however, we cannot assume that the "abihty- 
distance" from one boy to the next is a constant quantity. The boys 
near the extreme ends of the series will be farther removed from one 
another than the boys near the middle. We could only adjust for this 
if we knew the law of frequency-distribution for this kind of ability in 
this particular species of boy, and theoretically the determination of 
this distribution depends upon a prior fixing of the psychological unit, 
the unit of "ability-distance"; so that strictly the problem is insoluble. 
Since however under certain definite and indefinite conditions the form 
of distribution in a large number of biological and other cases of 
"physical" measurement has been found to be either Gaussian (normal) 
or differing from normal in ways described by Pearson's family of 
frequency curves, we might, with some probability of being near the 
truth, assume the normal form of distribution in the given case and so 
* See EbbinghauB, Orundzuge der Paychologie, 1905, pp. 75, 76. 
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obtain a quantitative measure for the ability of each particular boy*. 
A direct psychological determination and application of the (conven- 
tional) unit-distance is not, perhaps, an entirely impossible problem, 
and work in this direction may be expected and if achieved would 
certainly be much more scientific and psychological than the present 
method of measuring in terms of the external quantum of work done. 

Finally, the interrelations of different mental abilities within any 
well-defined group of individuals situated within any definite environ- 
ment may be determined by means of the technical method of " correla- 
tion." A correlation coefficient or other similar constant (e.g. correlation 
ratio) measures the tendsncy towards concomitant variation of two 
mental or other abilities within a group of individuals. The result may 
be transferred to any single individual within the group as measuring the 
degree of probabihty of connection of the two abilities in the particular 
case. The correlation between two abihties may be due to an actual direct 
relation of the abilities to one another, or, indirectly, to the influence of a 
common external environment upon them both. The first of these two 
cases is perhaps the more important, but the possibility of the second 
should not be lost sight of, and it also has a special interest of its own. 
The problems of correlation will be considered more fully in a later chapter. 

(5) THE APPROACH TO MEASUREMENT BY MEANS OP GRADING 
MAGNITUDES AND THEIR DIFFERENCES 

Since the publication of the first edition of this book an important 
symposium bearing on the question of mental measurement has been 
held (in 1913). The exact problem submitted to the joint meeting of 
the Mind Association, the Aristotehan Society, and the British Psycho- 
logical Society, was "Are the Intensity-differences of Sensation Quanti- 
tative?" Although many of the arguments of that discussionf are 
beyond the province of this book, it is of interest to note that there was 
a general consensus of opinion that sensation-intensities, and their 
differences, are at least "magnitudes" which can be graded, even if 
they be not "quantities" which can be measured. And following out 
further a suggestion contained in a quotation from Mr Bertrand Russell 
made by Professor Dawes Hicks in his contribution to the symposium, 
it may here be pointed out that grading leads, if the differences can 
also be graded, to something almost indistinguishable from, if indeed it 

* This was done, e.g. by Professor Pearson in his paper in Biometril.a, 1907, v. p. 105, 
and the example has been followed with success by other workers. 

t The papers are published in the British Joum. of Psychol. 1913, vi. pp. 137 — 189, 
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be not identical with, true measurement. For if we can arrange in order of 
magnitude a,b,c, ... and also their first differences a — b,b — c,c — d, ... 
and the differences of these differences in turn, and so on, we can space 
out the original quantities a, b, c, ... as accurately as though we used 
a unit and measured them. To take a simple example, suppose five 
quantities a, b, c, d, e have really the measures 10, 16, 20, 31, 32. If an 
observer, ignorant of these measures, only knows the order of grading 
a, b, c, d, e of the quantities, he has already made a considerable advance 
even although he does not know the spacing, or the distances apart. 
If however he further can grade the differences, that is, if he knows that 
the greatest difference is that between d and c, and that the others 
follow in the order 6 — a, c — 6, e — c?, he has advanced further towards 
measurement, in the sense of accurate spacing, although there are still 
many spacings that will satisfy these gradings. Thus far he can almost 
always go in mental phenomena. And although it is practically difficult, 
there does not seem any theoretical difficulty about taking the next 
step, and grading the differences of the second order. In our example 
this grading is {d - c) - {b - a) or a, say, 

(c - 6) - (e - d) or |8, 
(6 — a) — (c — b) or y: 
and the order of the third differences is 

(« - ^), (i8 - y). 
If now we could have all these gradings we could space out the original 
quantities very closely indeed to their true positions. This can be best 
seen by attempting to alter some one of the values while leaving all 
these gradings unaltered. Make d, for example, 29 instead of 31 and 
although the order a, b, c, d, e is unchanged, and also the order of the 
first differences, that of the second differences is completely altered. 

With an infinite number of quantities, and all the gradings of all 
their differences, we should, it would seem, arrive at an exact solution 
of the problem, so that grading and measurement are not perhaps so 
different in their nature as might at first be thought. 

Indeed a case could well be made out for the thesis that the 
theoretical objections sometimes brought against mental measurement 
really hold in the last resort against all measurement, and prove too 
much: and that the real difference between mental measurement and 
physical measurement is simply that mental phenomena, being practi- 
cally more difficult to handle, force on our notice the epistemological 
difficulties inherent in all measurement, whereas in physical measurement 
familiarity has bred contempt. 



CHAPTER II 

THE ELEMENTARY THEORY OP PROBABILITY 

Some statistical terms — ^Arithmetical short-cuts — ^Measures of scatter — ^The funda- 
mental theorem in probability — ^The binomial expansion — ^The normal curve of error 
— Fitting a normal curve to distribution data — ^The method of least squares. 

(1) SOME STATISTICAL TERMS 

The theory of probability was developed chiefly from two classes of 
material, (a) games of chance such as coin or dice throwing, roulette, 
etc., and (6) statistics, as they are called, that is such collections of 
quantitative information as the census, trade returns, insurance data and 
the like. It is easy to see that psychological experiments frequently 
resemble both of these classes. Any experiment, the result of which 
depends upon a human decision, has much in common with the throw 
of a die. In both cases it often seems mere chance what the result is, 
although we believe that in both cases this is due only to our ignorance 
of the numerous factors at work. And, since this is so, any scientific 
experiment on the actions and reactions of human beings must neces- 
sarily be repeated many times, until there accumulates a mass of 
quantitative information similar in many respects to a census return. 

The mass of quantitative information thus accumulated is found 
upon examination to have certain pecuharities or properties. For 
example consider the following case: — The experiments, carried out by 
Professor Urban in 1906 — 7, were on hfted weights. A standard weight 
of 100 grams was compared, by lifting it, with weights of 84, 88, 92, 
96, 100, 104 and 108 grams. The standard weight was always lifted 
first, and as the second unknown weight was hfted the judgment fighter, 
equal or heavier, was given*. 

Suppose the following answers were obtained on one occasion : 

108 grams, answer heavier 

104 „ „ equal 

100 „ „ heavier 

96 „ „ lighter 

92 „ „ equal 

88 „ „ lighter 

84 „ „ lighter 

* See "Die psyohophysischen Massmethoden ala Grundlagen empirisoher Messungen," 
by F. M. Urban, ArcUv fur die gesanUe Psydiologie, 1909, xv. p. 261. 
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In this series the lowest answer heavier is at 100 grams. Let this experi- 
ment be repeated 400 times, and in every series let the position of the 
lowest answer heavier be recorded. In a particular case the distribution 
of these just perceptibly heavier points was as follows: 



Grams 


... 84 


88 


92 


96 


100 


104 


108 


Frequency ... 


1 


8 


36 


85 


143 


119 


8 



These particulars are shown in graphic form in the adjoining figure 
where the points have been joined by straight lines to, make a polygon, 
which shows at once the chief peculiarities of such a collection of data. 



Frequency as just perceptibly heavier point 

s 1 i 








\/ 


\ 


\ 






/ 


1 








/ 


/ 






1 



84 88 '92 96 100 104 108 grams 
Fig. 2. A cocked hat curve (Urban's Subject III) 

namely that the points are not scattered anyhow, but occur most fre- 
quently at a central value from which the frequency falls off in both 
directions. Such a figure as this is colloquially termed a cocked hat. The 
point where the summit occurs, or point of greatest frequency, is called 
the mode. Here it is apparently at 100 grams, but if other weights 
between 100 and 104 grams could have been examined, it might be found 
to be elsewhere. The figure looks as though a smooth curve through the 
points would bring the mode somewhere a little higher than 100 grams. 
The middlemost of the 400 positions of the just perceptibly heavier 
point is called the median. It too is in the 100 gram group, though here 
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again its exact position is uncertain. A better known central value (and 

one which can be more exactly calculated) is the mean or average value, 

found thus : 

Ix 84= 84 

8x 88= 704 

36 X 92= 3312 

85 X 96= 8160 

143x100 = 14300 

119x104 = 12376 

8x108= 864 



Divide by 400 ) 39800 



99-5 grams 



In the above cases the only possible values of the sought for points 
were, by the nature of the experiments, at certain definite values. In 
another class of experiment, however, all values within the range are 
possible as in the following case. Twenty-nine experiments were made 
of bisecting a line by eye*. The lengths of the left-hand half of the line 
on these 29 occasions are given in this table, arranged in order of 
magnitude: 

63-8 mms. 

62-2 

621 

61-4 

61-3 

61-2 

61-2 upper Quaitile 



611 




61-0 




60-9 




60-8 




60-6 




60-4 




60-3 




600 


Median 


69-9 




69-9 




69-6 




59-5 




59-2 




69-2 




691 


lower Qaartile 



690 
58-8 
68-7 
58-6 
58-2 
68-1 
67-6 



29) 1743-7 

60-13 Mean. 
* AnlJ^experimentlperformed for the purpose of illustrating this chapter. Subject, 
G. H. Thomson. 
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Here the median by counting is 60 mms. and the mean on calculation is 
found to be 60-13 mms. In this case the "cocked hat" is not at first 
sight evident, but if the data are grouped in some way it comes to light. 
For example, they can be arranged thus : 

One leading from 63 to 63-9 mms. inclusive 

Two readings 

Sis „ 

Six „ 

Eight „ 

Five „ 

One reading 

Here the numbers 1, 2, 6, 6, 8, 5, 1 show clearly the concentration in 
the middle, although being smaller they are not so regular as in the 
previous "cocked hat." Instead of concentrating these at points it 
seems more accurate to construct a diagram such as that here given 



62 




62-9 




61 




61-9 




60 




60-9 




59 




59-9 




58 




58-9 




57 




57-9 





8 
7 


















- 






6 
5 
4 


















- 












3 


- 












2 
1 





































.57 58 69 60 61 62 63 64 

Millimetres 

Kg. 3. A histogram formed from the bisection data 



where a rectangle of the proper height is constructed over the corre- 
sponding base. Such a figure is called a "histogram." 

This example is convenient for explaining two new terms, namely 
the "quartiles" and "semi-interquartile range." The "quartiles" are 
the readings which are one-quarter distant from each end, just as the 
"median" is half-way. In the table on page 15 there are 29 readings. 



The half-way reading is 



29 -fl 



or 15 from either end, and the quartiles 



29 -fl 



are at the readings — ^ — or 7^ from each end, for which we can take 
the mean of the seventh and eighth readings at 59-05 for the lower 
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quartile and 61'15 for the upper quartile. The semi-interquartile range 
is half the distance between the quartiles. It is, in this case, 



61-15 - 59-05 
2 



= 1-05. 



G^ 



Clearly the semi-interquartile range is a crude measure of the scatter 
of the readings. When it is large the readings are more scattered, and 
therefore, any one of them is less likely to be correct, than if the semi- 
interquartile range had been small. 

The mean value is sometimes called the expectation. This term arises 
from games of chance. For example, suppose I am playing against an 
opponent at a game of dice, in which my opponent has to give me as 
many shillings as there are pips in a single throw of the die, then the sum 
which I ought to give him before each throw, in order to make the game 
an even one, is three shillings and sixpence, which is my expectation of 
gain. It is not the most probable sum for my opponent to give me ; indeed 
he will never give me this exact sum, since he always pays me in shilUngs 
and not in pence. But at the end of a sufficiently large number of throws 
I shall have received approximately this sum per throw on the average. 

From another point of view the mean will be found to correspond 
to a centre of gravity, 
namely the centroid of the 
curve of distribution of the 
readings. Let the adjoining 
figurerepresent such a curve, 
the number of readings 
which occur within a portion 
AB of the range being represented by the area included between the 
ordinates at A and B. Then the whole area of the curve will represent 
N the total number of readings. That is 




Fig. 4 



N- 



jydx 



where the integration is over the whole curve. Bach reading is repre- 
sented by its position on the a;-axis, and the sum of all the readings is 



The mean is therefore given by 

\xydx 

jydx 

B. &T( 



xydx. 



or 



^jxyd. 
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But this is the expression for the abscissa of the centre of gravity of 
the curve, through which the ordinate above the mean therefore passes. 

(2) SOME ARITHMETICAL SHORT-CUTS 

There are several devices which considerably shorten the arithmetical 
labour of finding the mean. These can be illustrated by the example 
on page 15, near the top. 

Choosing a convenient origin. The occurrence of large numbers in the 
calculations is minimised if an origin is chosen within the actual range 
of distribution, preferably either at one end or in the middle. In the 
example we have in mind, the point 100 grams might be selected, as the 
large number 143 is thus avoided. The calculations would then appear 
as follows : 



Ix 


-16 


-16 


8x 


-12 


-96 


36 X 


- 8 


-288 


85 X 


- 4 


-340 
-740 


119 X 


4 


476 


8x 


8 


64 



540 
-740 



400) -200 

-0-5 
origin 100 



+ 99-5 Mean 



Choosing a convenient unit. Since in this particular case the measure- 
ments are all made at intervals of 4 grams, it is a considerable simplifica- 
tion if this is chosen as the unit. The same calculation then appears as 
follows: 



Ix 


-4 




-4 


8x 


-3 




-24 


36 X 


-2 




-72 


85 X 


-1 




-85 
-185 


119 X 


1 




119 


8x 


2 




16 

135 

-185 






400) 


- 50 
- 125 

4 grams working unit 
-•5 




origin 


100 








+ 99-5 Mean 
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The Summation Method of finding the Mean 

This is only applicable in cases like the present where the readings 
are concentrated at a number of equidistant points. The calculation then 
appears as follows: 



84 grams 
88 „ 


1 
8 


400 
399 


92 „ 


36 


391 


96 „ 


85 


355 


100 „ 


143 


270 


104 „ 


119 


127 


108 „ 


8 
400 


8 
)1950 
4-875 

4 grams per unit 
19-5 
origin 80 

99-5 Mean 



The figures in the second column are the continued sum, from below 
upwards, of the figures in the first column. They are then themselves 
added up, the total being 1950. Now it is clear from its method of 
formation that this total is composed as follows: 

7x8 + 6x 119 + 5x 143 + 4x 85 + 3x 36 + 2x8 + 1x1. 
That is to say, it corresponds to taking the origin at 80 grams, and a 
unit of 4 grams. 

It is instructive to try this calculation from the other end, as it were, 
making the continued sum from above downwards. The reader should 
try other modifications of the idea underlying this summation method. 
For example, how should it be carried out in order to place the origin 
at 84 grams? 

(3) MEASURES OF SCATTER 

Let us consider again the experiment of bisecting a line, described 
on p. 15. Since the lines bisected measured 126 mms., the true half 
lay at 63 mms. The mean of the trials was 60" 13 mms. so that an error 
of 2-87 mms. was made. This error does not in itself however give a 
complete description of the subject's performance; a measure of scatter 
is required. One such is the semi-interquartile range. When we say that 
the semi-interquartile range is 1'05 mms. we mean that half the attempts 
at bisection lay within a range of 2-1 mms., and, since half the trials 
already made come within this range, the probabihty of the next trial 
also coming within it is one-half, apart from practice improvement. 

2—2 
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We must, however, next consider other and more usual ways of 
indicating scatter, although the use of the semi-interquartile range as 
a check is most valuable, since it is a quantity about which glaring errors 
are not likely to be made. 

(1) There is first the "mean variation," a measure much in use 
among psychologists but not one to be recommended. In this the 
deviation or variation of each reading from the mean is written down, 
and the mean of these found, disregarding their sign. Taking the same 
example of bisecting a line we obtain 

Sum of deviations regardless of sign , , , 

M.v. = =r= — i ir-2 2_ = 1.14 mms. 

JM umber oi cases n 

The mean variation is often, as here, about the same size as, or a little 
larger than, the semi-interquartile range, which latter, it will be seen, 
is indeed the median variation, from the median, though this name is 
not used. 

(2) The "standard deviation," generally denoted by a, is by far 
the best measure of scatter to use, for reasons which will gradually 
become clear in the process of studying the subject. Its long title is 
the "root mean square deviation" and it is obtained by squaring each 
deviation, adding the squares all together, dividing by the number of 
readings and taking the square root: 

_ /Sum of squares of deviations 
V Number of cases n 

If this process be carried out the value tr = ] -38 will be found for our 
example. 

This is admittedly a longer process than finding the mean variation. 
It can be simplified by the following device, which should be used 
whenever the mean of the readings is a number involving awkward 
decimals. Instead of taking the deviations from the real mean, here 
60-13 mms., let them be taken from some convenient point, here say 
60 mms., chosen by mere inspection. Proceed now exactly as before, 
squaring and adding the deviations. But from the mean of the squared 
deviations must be deducted the square of the distance between the 
real mean and the convenient point from which the deviations have 
been taken (see example opposite). 

The proof that this then gives the mean of the squares of the real 
deviations is easily obtainable by elementary algebra. 

This shows one very important property of the mean, namely, that 
it is the point where the sum of the squares of the deviations is a 
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minimum: for the provisional mean square is always greater than the 
real mean square, since the correction subtracted is essentially positive. 



Bisection data (page 15) 
Deviations from 60 mms. Squares of preceding 



3-8 


14-44 


2-2 


4-84 


21 


4-4] 


1-4 


1-96 


1-3 


1-69 


1-2 


1-44 


1-2 


1-44 


11 


1-21 


1-0 


1-00 


0-9 


0-81 


0-8 


0-64 


0-6 


0-36 


0-4 


0-16 


0-3 


0-09 


00 


0-00 


01 


0-01 


01 


001 


0-4 


016 


0-5 


0-25 


0-8 


0-64 


0-8 


0-64 


0-9 


0-81 


10 


1-00 


1-2 


1-44 


1-3 


1-69 


1-4 


1-96 


1-8 


3-24 


1-9 


3-61 


2-4 


5-76 




29)55-71 



1-92 provisional mean square 
Subtract 0-13' =0-02 

1-90 real mean square 

Vl-90 = l-38=<r. 

In this section we have, throughout, taken the mean as the central 
value from which the scatter was to be measured. In the great majority 
of cases this is the practical plan ; but if necessary, measures of scatter 
from some other central value could be used. Indeed as has been 
pointed out, the semi-interquartile range is the median of the deviations 
from the median. 

If the distribution of readings is represented by a smooth curve as 
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in the diagram on p. 17, then it will be seen that the mean square 
deviation about the origin is given by the expression 



^jx^ydx 



so that we have the equation, symbolising the above calculation, 

a* = ^ 1 x^ydx — Mean^, 

the integrals being as before over the whole of the distribution. 

In practical work, instead of actually giving the standard deviation, 
it is more usual to quote a quantity called the "probable error," which 
may, for the present, be arbitrarily defined as equal to •67449tT, i.e. it is 
an arbitrary reduction of a. To the meaning of this reduction we shall 
return presently. 

The Standard Deviation about the True Value 

In the section above we defined a quantity known as the Standard 
Deviation about the Mean. From another point of view, however, we 
sometimes require the value of the Standard Deviation about the True 
Value of the quantity which is being measured. This will always be a 
little larger than the former quantity, unless the true value happens to 
coincide exactly with the Mean. This follows from the theorem that 
the sum of the squares of the deviations is a minimum at the Mean. 

Of course if we do not know the true value of what we are measuring, 
we shall be unable to find this quantity exactly, but it can be shown 
that it is approximated to if we divide by w — 1 instead of n after 
finding the sum of the squares of the deviations. 

Let ff be standard deviation about mean a, and a' standard deviation 
about true value A. 

Let %, ttg, ttg, ... a„ be the readings. 

Let Ci, Ca, eg, ... e„ be the true errors of those readings, and let e be 
the error of the mean a. 

Then a - a^-^ A + e - (A + e^) = e - e^, 

a — a^ = e — e^, etc. 
Therefore 

^ = 'S (a - aic? _ S{e- e^f ^ ne^ - 2eS {e^) + S je^^) 
n n n ' 

Now na'^ = S (e/) 

and ne = S (e^), 
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so that we have 

The last term is approximately zero, the positive errors cancelUng the 
negative. 

We have finally 




- 1 

It will be seen that when n is sufficiently large the two quantities 
become practically identical. It is when n is small that the correction 
becomes of importance. This is especially seen from a consideration of 
an extreme case, namely where only one measurement is made. 

Let us say that the one measurement made has the value v. Then 
the mean has also the value ■;;, the deviation is zero, and the standard 
deviation is therefore the square root of zero divided by unity. That is, 
the standard deviation about the mean is zero. The standard deviation 
about the true value is, however, by the above rule, indeterminate, 
being the square root of zero divided by zero. 

The reciprocal of the standard deviation is frequently used as a 
measure of the accuracy of a set of observations. It is clear from the 
above that if the number of observations is small it is the standard 
deviation about the true value which must be used, that is we must 
employ n — I in the denominator instead of n. For otherwise the 
accuracy of a single measurement would be infinite. 

The Standard Deviation of the Arithmetical Mean 

In a section above it was shown that the mean square of the devia- 
tions of a set of readings is a minimum at the arithmetical mean. About 
any other value distant e from the mean this mean square is increased 
by e^- 

Now the mean square deviation about the mean is a^. But the mean 
square deviation about the true value is na^/(n — 1). Therefore the 
expected square deviation of the mean from the true value is given by 

e^ = 5 — ff^ = 7 . 
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The expected square deviation (before experience) is, however, the same 
thing as the mean square deviation (after experience), so that the 
standard deviation of the mean about the true value is therefore 



and that about the mean of the means will be 

The standard deviation of a mean is therefore obtained by dividing the 
standard deviation of the whole distribution by the square root of the 
number of readings. 

The Standard Deviation of Sum or Difference 

Let a; be a quantity whose standard deviation is a^, and y a quantity 
whose standard deviation is o-^; required the standard deviation of the 
sum x + y. Let m^ be the mean value of x and m^ the mean value of y. 
Then any single value of the sum x + y will be of the form 

m^+8^ + my + Sy. 

Moreover the mean of the sum x + y is equal to m^ + m^, so that the 
deviation of the above reading is 

Sa, + Sj,. 

The mean square deviation of the sum x + y is therefore 

o^x+v =S{S^ + hyfjn = S (SJ^yn + S (S^S„)/w + ;S {S/)/n. 
The quantity S (S^Sy) however will be very small if not zero, because 
the factors S^. and 8,, are not connected with one another in any way, 
and their products are as likely to be positive as negative, so that the 
positive values will annul the negative on the average. We have, 
therefore, finally: 

rxy^^ = S(S/) + S(8y^) 
= na/ + JKr/, 

<^x-Hi = y/(pi + V)- 

Exactly the same reasoning holds for the value of o-^_j,, the only 
change being in the sign of m„ and S^, and therefore of S (82,85,), which 
is however zero. The final result is the same 

'^x-^ = Vi<^x' + V)- 

It is assumed in both these formulae that x and y are independent and 
uncorrelated. 
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(4) THE FUNDAMENTAL THEOREM IN PROBABILITY 

In ordinary usage, when we say that an event is probable under 
certain circumstances, we mean that it is more likely to happen than 
not to happen, and by improbable we mean that it is more likely not to 
happen than to happen. 

If we cross-question ourselves as to why we think an event is probable 
we often find that it is because we have more frequently found it to 
happen than to fail under similar circumstances in the past. 

If we wish to apply mathematical treatment to probabiUty we must 
decide on a quantitative measure for it. We do so by using a fraction 
(vulgar or decimal) for this purpose, in such a way that the fraction 
rises and falls with the probability, becoming unity for "certain to 
happen" and zero for "certain not to happen." The numerator of this 
fraction is the number of equally probable ways in which the event can 
happen, while the denominator is the total number of equally probable 
ways in which the event can, under the given circumstances, either 
happen or fail*. 

Thus, for example, consider the probability that with one throw of 
a six-faced die a score of more than four will be obtained. This can 
happen in two ways, namely, by throwing a five or a six. The total 
number of possible throws is six, and therefore the probability is | or J. 

This method of giving a quantitative value to a probability is clearly 
connected with the method adopted in betting. For instance, odds of 
"3 to 1 against" a certain event means that the speaker judges that 
there is only one chance of success to three chances of failure. The 
fraction representing the probabiUty of success is therefore 

1 1 

3+1 4' 

If the experiment with dice mentioned above be actually performed 
a large number of times, it will be found that the number of occasions 
on which the score exceeds 4 will closely approximate to one-third of 
the whole. If therefore we had never seen dice and had no idea of their 
appearance, but were told that a large number of throws always included 
about one-third which were over fourf , we should conclude that the 
probability of obtaining a throw of more than four was about one-third. 

* Cf. however Jeffries and Wrinch, Phil. Mag. 1919. 

t The late Professor Weldon found 106602/(12 x 26306) =0-3377. See Phil. Mag. 
June 1900, article by Professor K. Pearson. 
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This method of finding probabilities, by deducing them from a large 
number of actual experiments, is that followed most frequently in 
practice. 

For example if the points of an aesthesiometer are applied to a 
subject's forearm, with the points distant 3 cms. from one another, 
the average subject will usually recognise that two points are present. 
Occasionally however he will only feel one point. What is the proba- 
bihty that he will answer two? This can only be decided by experiment. 
In an actual case, 150 trials were made at a number of different sittings. 
On 105 occasions the answer two was returned. If the conditions have 
been the same throughout then the probabiUty of an answer two 
under these conditions is J-|^ = 0-7. 

If the probabihty of an event occurring is f then the probability of 
it not occurring is clearly \ — f, for which q is often written. For the 
event is certain to happen or not to happen, and therefore the two 
probabihties must add up to "certainty" that is to unity. Thus in the 
above case the probabiUty of an answer two not being returned is 0-3. 
Similarly the chance of throwing an ace at dice is \ and of not throwing 
an ace is f . The chance of obtaining a head on throwing a coin is |, 
and the chance of not obtaining a head is J. 

Next consider the probability of an event happening twice in suc- 
cession, if its probabihty is f for one occurrence. Take a few specific 
cases first. If two coins are thrown or one coin thrown twice, there are 
four equally likely things than can happen, namely: 



bead 


head 


taU 


taU 


head 


tail 


taU 


head 



The probabihty of getting two heads is therefore \ which, it will be 
noticed, is equal to (\Y. 

Next consider two throws of a six-faced die. There are here no less 
than 36 things which can happen, for whatever value the first throw 
has there are six different throws of the second die which can be associated 
with it: and since the first can also have six values there are 6 x 6 or 
36 combinations possible. Only one of these combinations consists of 
two aces, so that the chance of throwing two aces is -^ or {\Y^ Generally, 
if the chance of an event is p then the chance of it occurring twice in 
succession is -p^. For suppose there are m ways in which it can happen, 
and n ways in which it can fail, then f = mj(m + n). Also, having 
happened once, there are m ways in which the second success can occur, 
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each of which ways might be associated with the first success ; and since 
there are also m ways of this first success happening there are in all 
m? ways of the double success happening. Similarly there are in all 
(m + nY ways of the double event either happening or failing, and 
therefore the chance of a double success is 



(m + nY 

Reasoning exactly similar to the above will enable the reader to 
convince himself of the truth of the following general theorem : 

If there are a number of independent events a,b,c,... etc. and their 
respective probabihties of occurrence are pa, p„, p^, ... etc., then the 
probability of all occurring is the product 

PaXn^PcX ■■■■ 

This may be said to be the fundamental proposition in probabihty. Its 
use will be recognised best by an example. It will be instructive to take 
a real psychometric experiment, that already described in the experi- 
ment on weight-lifting on p. 13. The subject of those experiments com- 
pared each of the weights with the standard 450 times. The results of 
this were as follows* : 



Grams 


84 


88 


92 


96 


100 


104 


108 


No. of answers heavier 





11 


32 


100 


212 


402 


431 



From these the probabihty of the subject answering "heavier" at any 

weight can be calculated, always assuming that the conditions have 

throughout remained the same, by dividing the above numbers by 450. 

We thus obtain the following: 

Grams 84 88 92 96 100 104 108 

Probability of 1 ^^^^ ^^^^^ ^_^^^^ ^,.2222 0-4711 0-8933 0-9578 
answer heavier ) 

Let US now apply our theorem on the combination of probabihties 
to solve the following problem : 

Let the weights 84, 88 grams etc. be each presented once to the subject. 
What is the probability that the lowest answer " heavier " will be at 96 grams ? 

The probabihty that he will answer "heavier" at 96 grams is 0-2222. 
The probability that he will not answer "heavier" at 92 grams is 
1 - 0-0711 = 0-9289. Similarly the probability that he will not answer 
"heavier" at 88 grams is 0-9756, and at 84 grams is 1-0000. The proba- 

* Urban, op. cit. p. 287, Table XI (multiply the values by 450). Or alternatively 
consult Table V, p. 175, of The Application of Statistical Methods to the Problems of 
Psychophysics, by F. M. Urban, Philadelphia, 1908. 
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bility of the combination happening, namely, an answer "heavier" at 
96 and none below, is, therefore 

1-0000 X 0-9756 x 0-9289 x 0-2222 = 0-2014. 
The actual frequency with which this occurred was 0-2125. 

(5) IMPORTANCE OP THE BINOMIAL EXPANSION IN THE 
THEORY OP PROBABILITY 

By a not unnatural hypothesis, which has been widely accepted and 
has proved very fruitful, an error of measurement, such as that made 
in judging the half-way point in a Une in an experiment above, may be 
looked upon as the resultant of a large number of small circumstances, 
each of which sometimes sways our measurement in the one direction, 
sometimes in the other. This hypothesis, combined with the fundamental 
theorem of probabihty just explained, leads to the use of the binomial 
expansion in describing distributions of error. This can be illustrated 
by the following example: — Let us suppose that a quantity which we 
desire to measure has really the value 13| units, but that we are opposed 
in our efforts to measure it by seven "Djinns," each of whom has the 
power of displacing our measurement by one half-unit. Let us further 
imagine that each of these mischievous imps, in an endeavour to prevent 
our making any steady measurement, decides that he will add or deduct 
his half-unit according to the throw, heads or tails, of a coin. Whenever 
we try to make a measurement, therefore, these invisible seven will 
assemble and throw each his coin in the air. If all the coins happen to 
come heads, seven half-units are cunningly added to the 13^, and we 
obtain a measurement of 17. If all come tails, we get 13^ minus 3|, 
or 10. If on another occasion five are heads and two are tails, five half- 
units will be added and two subtracted, giving 

13i + f-|=15. 

An actual test of this is given in the following figures and diagram. 
Seven coins were thrown on 128 occasions, and each time the proper 
number of half-units was added (for the heads) or subtracted (for the 
tails) from 13^. The result was as follows: 

heads and 7 tails occurred 2 times giving a value 10 units 



1 


, „ 6 „ 


8 „ 


■ „ 11 , 


2 


, „ 5 .. 


, 16 „ 


, „ 12 , 


3 


, ,. i „ 


38 „ 


, ., 13 , 


4 


9 »» ^ »7 


, 43 „ 


, „ 14 , 


5 


, „ 2 „ 


, 16 „ 


, „ 16 , 


6 


, „ 1 „ 


2 „ 


, „ 16 , 


7 


, „ „ 


3 „ 


, „ 17 , 



Total 128 
This distribution is shown in Fig. 5. 
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The general resemblance between the diagram, made by throwing 
coins, and previous diagrams which represent the result of psychological 
experiments is not surprising if we consider for a moment what the 
condition of such experiments are. The "Djinns" which oppose our 
efforts to obtain a true value for say the spatial threshold are in- 
numerable. Some are in the fingers of the experimenter, and make him 
press irregularly on the aesthesiometer points. Others cause noises to 
happen in the neighbourhood to distract the subject's attention. Other 
Djinns make the instrument hot one day and cold the next, others live 
in the subject's skin, and quite a lot are engaged in stirring up vivid 
imaginations^in his mind so that he feels all kinds of prickles and 

50 



o 40 
i 30 



S 20 

S 
fi 10 



"01234567 

Number of heads 

Kg. 5. Number of heads in a throw of seven coins in 128 repetitions 
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tinglings which make his judgments on the position of the points as 
erratic at times as is the throw of a coin. 

In the figure, there is shown, in addition to the polygon based on 
experiment, a dotted theoretical polygon to which we now turn our 
attention. Since the probability of obtaining a head at one throw is \, 
the probability of obtaining seven heads in a throw of seven coins is 

{\V or T^. 

The probability that a certain coin will give a tail but all the others 
heads, is (^)^ x (1 — |) which also equals j^. As there are seven coins 
in all, each of which might give the only tail, the total chance of 
obtaining six heads and one tail is y|g. The probabihty that two 
specified coins out of the seven will give tails, the other five giving heads. 



is (i)«x (1-1)2 = 



ITS' 



The number of ways in which the two can be 
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specified is 21. The total chance of obtaining five heads and two tails is, 
therefore, 21/128, obtained thus, 

7! /1\5 /1\5 _ ^ 

5! 2! [2) \2) ~128" 

It will be seen that the above probabilities of obtaining seven, six or five 
heads are the first three terms in the expansion of (| + J)'. The next 
term will similarly be found to be the probabihty of obtaining four 
heads, and so on. We thus obtain the following results: 



Probability of 7 heads 1 in 128 

» ., 6 „ 7 „ 

» ,. 5 .. 21 „ 

» » 4 >> 35 „ 

j» ,, 3 „ 35 „ 

„ 2 „ 21 „ 
„ „ 1 head 7 „ 

„ „ no heads 1 „ 

128 



It is these numbers which are shown by the dotted hne in the figure. 

In general, if p is the probability of an event succeeding, and q of 
it not succeeding, the respective chances of it succeeding 

k,k-l,k-2, ...3, 2, 1 or 

times in k trials are given by the terms of the expansion of (p + q^. 

In n groups of k trials therefore the most probable numbers of 
successes in the various groups are 



n ip'' 



+ kp^-^q + 



Mfc -!)„,_, 



1.2 



pk-2q2 j^ _ _ ^ qk 



For example let the event be throwing either an ace or a six with a 
six-faced die, let six trials be made in each group, and let a thousand 
groups be tried. Then the above expansion is 

Therefore the number of times a throw consisting of only "aces" or 
"sixes" will occur in 1000 trials with six dice each time is most probably 

__- = or once, to take the nearest integer. In the same way we 

get this table: 
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All 6 are "Aces or Sixes" on 
Only 5 „ „ „ 

»> ^ >» >» »> 

If " „ ,1 „ 

„ 1 is an ace or six 
None are aces or sixes 



1000 throws of six dice each time 
1000 



729 



or approx. 1 occasion 



12000 
729 " 


„ 16 occa 


60000 
729 " 


82 „ 


160000 
729 " 


220 „ 


240000 
729 " 


329 „ 


192000 
" ■ 729 " 


264 „ 


64000 


88 „ 


729 " 



(Mode) 



1000 



„ 1458000 „.^ 

— 729~^^^ 

M 2000 „ 

^"'^" = 1000=2 



Here the mode is at two. 

What is the meani In calculating it we shall take the exact fractions 
in the above table, not the approximate integers. The mean is then 
obtained thus: 

6x 1000-^729= 6000 -H 729 

5x 12000 H- 729= 60000-=- 729 

4x 60000+729=240000^729 

3 X 160000 -H 729= 480000 H- 729 

2 X 240000 H- 729 = 480000 + 729 

1 X 192000 + 729 = 192000 + 729 

Ox 64000 + 729= nil 

Here, therefore, the mean coincides with the made. 
In general we have from the binomial expansion 

Mean equals 

kp'' +{k-l) kp^-->^q + {k-2) ^^1^ p*-2?2 + 



= kp\p^-^+{k-l)p^-^q 



(k-l){k~2) ^ , , 

= kp {p + q)''-^ = kp since p + q = 1. 
The m^de can also be deduced from the expansion. The first term in 
the expansion, p, corresponds to k successes, the second term to ^ — 1 
successes, and so on, so that the term corresponding to any number m 



of successes is 



k\ 



M™ff 

ml {k — m)\^ 



ifn ftk — Ttt 
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This term is made from the preceding term by multiplying the latter by 

m+ 1 q 

k — m p 

and the succeeding term is made from it by multiplying by 

m q 

k — m+ 1 p' 

It will therefore be the largest term if 

m + 1 q , m q 

• — >■ 1 > . =- 

k — m p- k — m + 1 p' 

whence qm + q> pk — pm, pk — pm + p> qm, 

qm + pm > kp — q, kp + j) > qm + pm, 

m > kp — q, kp + p > m, 

so that kp — q < m < kp + p. 

The greatest term, therefore, is that which corresponds to a number 
of successes between kp — q and kp + p. The range thus indicated is 
unity since p + q= I. In the binomial expansion therefore the mean 
and the mode agree to an integer. 

Standard Deviation of the Binomial Expansion 

For the binomial expansion the standard deviation is equal to ^^{kpq). 
This can be proved as follows: 

In the expansion of (p + qf the first term p^ represents the proba- 
bility of the specified event succeeding k times. The mean number of 
times it succeeds is hp, so that the deviation is k— kp and the first 
term in o-^ is jj* x (^ — kpY- Similarly the second term kp^-'^ q of the 
binomial represents the probabiUty oi {k— 1) successes, and the devia- 
tion is therefore (k — 1) — kp, so that the second term in a^ is 

kp^-'^ q{k — \ — kp). 
Remembering that k— kp = kqwe get the following expansion for a^ 

(^ = (kqf pi' + (kq - \f kp^--^ q+{kq- 2)^ ^^l~^'' p^-^ q^+ ... 

= kY if + qf - 2ifcY (p + qf-^ + ^Tif^ ? + 4 ^ ^^ ~ ^) y«»-2g2 + .. I 

= kY - 2AV + kq {p*-i +2{k-l) p*-^ q + ...} 

- kY + h {{p + qf--' +(k-l)q(p + qf-^ 

-kY + h{'^ + (^-'^)q) 

- kY + kq{l + kq- q) 
= %{!-?) = kpq, a = '>^{kpq). 
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(6) THE NORMAL CURVE OF ERROR 

We have seen that a binomial expansion gives a cocked hat figure 
very like the actual diagrams obtained by experiment. In the imaginary 
example which we considered, seven Djinns added or subtracted each 
a half unit to or from the quantity 13| which we were trying to measure. 
We then obtained measurements extending from 10 to 17 units, but 
always at the exact units. 

In practice if we were handling a not easily measurable quantity 
we might well find our measurements range over this distance, but 
unless they were constrained to do so by some pecuharity of the method, 
they would not always occur at exact units, but at any distance. This 
case is covered if we imagine the number of Djinns (the factors of 
accidental error) to be much increased but the influence of each one 
made less : thus there might be 7000 Djinns each adding or subtracting 
a mere fraction of a unit, or ultimately an infinite number of them, each 
adding or subtracting an infinitesimal amount, just as an infinite number 
of the tiniest errors (we may well imagine) account for the variations 
in our experimental readings. 

What would the binomial expansion then become? That is, what 
form does the expansion of (| + 2^)* take when h becomes infinite and 
the terms are not unit distance apart but only an infinitesimal distance 
dx'i The whole range covered by the expansion is kdx, and the term at 
either end which occurs when all the errors are either positive or negative, 

is at a distance Tc -^, from the middle, so that each elementary error is 

now dxjl just as formerly it was half a unit. 

The term which corresponds to I errors being positive and h — I 

negative is p ^ ^^y {\f-n\l{l\ {k - l)\} (1), 

and the net error, the abscissa of this point, is 

X == {I - {k - I)} dx/2 (2). 

The next point, distant dx, is that corresponding to I + 1 positive and 
k — I — 1 negative errors, giving an abscissa of 
{l+l-{k-l-l)} dxj2. 
Its probability is 

P + dP = afk\/{{l + l)\(k-l-l) !}. 
The ratio of these is 

P + dP k-l dP dP k-2l-l 

P^'^l+l' ^ P' • P~ l+l ■ 

B. & T. 3 
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But ' = J- + o . from eqn. (2) for x above, so that 

dP 2 {2x + dx) 

T ~ " Ix + {h + 1) dx' 
Now we are going to make k, the number of atomic errors, equal to 
infinity, and we can therefore neglect the 2 in A + 2 and write 
dP 2 (2a; + dx) _ 4a; 2dx 



P 2x + kdx 2x+kdx 2x+kdx 



.(3). 



Therefore -=— , which is the quantity we require before we can integrate 

(tX 

and obtain the equation to the continuous curve to replace the binomial 
cocked hat, is given by 

_ldP_ 4a; 2 . 

P dx 2xdx + kdx^ 2x + kdx 
We must now consider the quantity kdx which gives the entire 
extent of scatter, the whole range. The number of errors k is infinite, 
we have assumed, and dx is infinitesimal. The range kdx may then 
either be finite or infinite. If we assume it to be finite, then kdx^ will 
be infinitesimal and the above equation becomes 

P dx 2xdx ^ 2x + kdx ^'1^'^"^' 

dP 
when dx becomes infinitesimal. Or -p = — infinity, except when P = 0. 

In this case the probability falls off infinitely quickly from the mean, 
and this is therefore a case of no scatter at all. If therefore we postulate 
an infinite number of elementary errors, we must allow a possible range 
of scatter of infinite extent, the only alternative being no scatter at all. 
But although the possible range is infinite, it will be found that at 
infinity the probabiUty is infinitesimal, that is the larger errors do not 
in practice occur. 

We take therefore , , 

Kdx = 00 , 

k {dxf = a finite quantity*, 

which we shall write = 4a*. 

{a will turn out presently to be our previous acquaintance the stan- 
dard deviation.) We then have from eqn. (4) 

I dP 4^ x^ 

~P~E~ 2xdx + ^a'^~^ ^^>' 

* Here again the alternative ought to be investigated. 
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and on integrating, 

log P = — jj-2 + constant, 

or P = Ce-^V(20 (6). 

This equation gives the probability P of any value of the error x 
occurring, and not only of any integral value, as the binomial does. 
The value of the constant C will be found presently, and it will also 
be shown that a is, as has been already asserted, the same standard 
deviation which we already know in another guise. The general shape 
of such Normal Curves is shown by the example in Fig. 6, p. 43. 

Some properties of the Normal Curve 

In the curve P = Ce~* '^^^ \ P is the probability of the occurrence 
of an error x. Let us consider of what order of magnitude the quantities 
P are. In the first place we remember that the curve was found as the 
limit of the expansion (^ + |)*, when Jc was made infinite. This curve 
however flattens out more and more as k is increased. 

For example 

(I + 1)^ = i + i + i 

So that clearly when Jc becomes infinite, the ordinates of the curve, as 
we have so far considered it, flatten out so that they are all infinitesimal. 
In other words the probabiUty P of any exact value of x occurring is 
really infinitesimal though it varies from one x to another x. C must, 
therefore, be infinitesimal. Let us try putting 

C = C'dx, 

so that if C is of the same order of smallness as dx, the new quantity C 
will be a finite constant. We have then 

p = (7'e-^V(2^')<Za; (7). 

This means, in geometrical language, that instead of using a curve whose 
ordinate P measures the probability of the occurrence of x, we had 
better use a curve where this probability is measured by the area of an 
elemental rectangle contained by two ordinates enclosing x, and distant 
dx from one another (cf. Fig. 4, p. 17). Such a curve will be similar in 
shape to the former but oi finite ordinates. 

We have assumed in the above argument that C as there defined 

3—2 
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is a finite quantity, that is that C is of the same degree of smallness 
as dx. That this is so may be shown by proceeding next to find the 
actual value of C" in the following manner. 

It is clear that the sum of the probabilities of all errors must equal 
unity, for at any one measurement it is certain that some error or other 
must occur, if we include zero error in the mathematical sense. That is 

00 

SP=1. 

— 00 

Substitute for P from the equation (7) above and replace the sign of 
summation by integration. We thus obtain 



C'f 



C r e-^'/i^'')dx=l (8). 

J —00 

Write x^/{2a^) = z^ xdx = 2aHdz; 

then the integral becomes 

e-^'^^= C'c7V2.2B* = C'aVi^) ......(9). 

Whence, since this equals unity, we have 

C = l/(c7 V^ ■\/tt), a finite quantity (10). 

So that we can now write 

av'(277)^ ^^^^• 

The ordinate of the curve defined on p. 35 is «/ = P\ix so that its 
equation is 

" ~ o- V(2t) 

Such a curve is called a probability curve. The probability of the 
occurrence of the value x is given by y^dx and the probability of x falling 

between a and 6 is given by ydx. The total area I ydx equals 

J & J — oo 

unity. 

If N measurements were made, and the errors were distributed 
according to such a curve, then the most probable number of times a 
deviation x occurred would be ^Px- ^ curve 

N 
a-v/(277-)' 

is similar to a probability curve, but each ordinate is N times as tall. 



y = :r4=ve-''^/<='^) (12). 



2/ = — 77^.e-»=V(2<r') 



* The reference is to Integral B in Appendix II giving the values of a nnmber of 
integrals of general use in the theory of probability. 
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It is a distribution curve and its total area is not unity but N, and the 
integral from a to 6 represents not the probability of x falling between 
a and b, but the most probable number of times it would fall in this 
range out of N trials. 

Since this curve is symmetrical, the mean, mode and median coincide. 

Let us find the standard deviation of such a set of measurements. 

The sum of the squares of the deviations is given by 

Ndx _a;2/(2<r2) ^ ^2 



(write x^ = 2ctV, xdx = 2(^zdz) 

a V (■^T'') J -00 za yZ yn 

The mean square deviation is Na^jN = a*. 

The root mean square deviation, or standard deviation, is therefore a, 
so that we were justified in using this letter on p. 34 when we wrote 

h {dxf = 4o-2. 
The quantity cr can be shown to have another significance in the normal 
curve, namely it is the distance from the centre to the point of inflection 
or point where the curve changes from convex to concave. The proof 
of this statement is as follows. At a point of inflection of a curve, the 
value of d^yjdx^ is zero. In our case 

^l'^''-;iKVi>)\'""""'-i''~"'"^}- 

If this is to equal zero then a^ must equal a^. 

Finally, if we suppose the area enclosed between a distribution curve 
and the axis of x to be spinning round the axis of y, then the moment 
of inertia is such that we can consider the whole weight concentrated 
equally at the two points of inflection. In other words, a is the "radius 
of gyration." For the area (or weight) of each vertical elementary 
column of the curve is ydx and its distance from the axis of gyration is x. 

The moment of inertia is therefore 

j_„o-V(2'r) 
* See Integral C, Appendix II, p. 202. 
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Write 35^ = 20^2^ and the integral becomes 

^ X C* = a^N. 
■y/TT 

Therefore the N can be considered as concentrated at a distance cr from 

N 
the centre, or -^ at each point of inflection. 

The Relation between Mean Variation and Standard Deviation, 
in the case of Normal Distribution 

The distribution is theoretically symmetrical and the mean of the 
positive variation ought to equal the mean of the negative variation, 
apart from sign. We have therefore 

S{x) 

M.V. = ^p-, 

S indicating sum from to oo of the variations x. Remembering that 
the frequency of occurrence of each x is 

-^e-'/i^''')dx 

we replace the sum S by an integral, obtaining 



N [ 



"xe-«V(2^") 



dx. 



rV(27r)Jo Nj2 
Writing x = y/2a$ and using integral A of our list of definite integrals 
(indeed this expression integrates) we obtain 



O-S/TT Jo 

■v/2 . 

The numerical value of ^ is -8 approx. 



2V2 1 ^2 
-■ -5- o- X s = -V "'• 



The Relation between Probable Error and Standard Deviation, 
in the Case of Normal Distribution 

The probable error was defined on p. 22 as an arbitrary reduction 
of the standard deviation, viz. 

p.E. = -6745(7. 

In the case of Normal Distribution a physical meaning can be given to 

* See Integral C in Appendix II, p. 202. 
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this quantity. Consider the number of cases which fall within the limits 
of the range ±-6745a. This number is 

aV(27T)Jo 

Values of the probability integral have been calculated and tabulated 

for all limits, and if such a table be consulted* it will be found that this 

N 
quantity has the value -^ . That is to say, the probable error gives the 

range within which one-half the cases may be expected to fall. It is 
more instructive to remember that half the cases may be expected to 
fall outside ± p.e. 

With skew distributions this meaning ceases to hold, and for these 
<7 should be used. Indeed it is in general a better quantity to employ. 

(7) ON FITTING A NORMAL CURVE TO DISTRIBUTION DATA 

Consider the experiment of bisecting a line of which the data are 
given on p. 15. The histogram of Fig. 3, p. 16 represents these data, 
and shows the density with which these points occur in each part 
of the range. We wish to replace this stepwise figure by a smooth 
Normal Curve which will give us at each point of the range the theoretical 
proportionate density of the bisection points at that spot, and we want 
this Normal Curve to be the most probable which can be based on the 
given data. 

The equation of the required curve is 

" aV(27r) 
where N is the number of experiments made, x is measured in mms., 
and {x — a) is a quantity measured from some central point of the data. 
In the theoretical curve (which is symmetrical) a is the point where 
mean, mode and median coincide. In the data however the mean is 
60'13 and the median 60 while the mode is unknown. What value shall 
we give to a so as to obtain the best fitting curve? 

To answer these questions it is necessary to consider what we mean 
by best fitting curve. 

The meaning of "best fitting curve" is really the same as the 

meaning attached to the phrase "most probable theory." By the best 

theory of any set of data we mean the theory from which the observed 

data could have chanced to spring with a greater probability than would 

* E.g. the first table in Pearson's Tables for Statisticians and Biometricians, 
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be the case with any other theory. Take a simple example. Suppose 
a bag is known to contain a large number of equal-sized balls and nothing 
else: but the colours of the balls are entirely unknown. Experiments 
have been carried out to learn something about their colours, each 
experiment taking the form of extracting one ball, noting its colour, 
and replacing it. Ten such experiments have been made and the results 
are as follows: 

5 black balls, 

3 white balls, 

1 led ball, 

1 green ball. 

What is the best theory of the composition of the bag? 

The best theory on the facts as given, is that the bag contains 
^ths black balls, ^ths white balls, J^th red balls, and ^th green balls. 

Suppose however that another theory was advanced, namely that 
the bag contained ^ths black balls, ^ths white balls, and ^th each of 
red, green, yellow and blue balls. How should these two theories be 
compared? 

The proper plan is to find the probabihty, on each theory, that 
ten dips would result in what was actually observed. Then that theory 
is best for which this probability is greatest. Let us take first the theory 
which we assert to be the best. The probabihty, on that theory, of 
drawing five black, three white, one red and one green ball (the order 
being immaterial) is 

The corresponding probability on the other theory is 

The former theory is therefore the better of the two. If from other 
sources we knew however that the bag did contain balls of six colours, 
then the first theory would be ruled out, and the second theory would 
have to be compared with other six-colour theories. The reader might, 
for example, compare it with the theory that the bag contains ^Ajths 
black, Y^ths white, ^ths red and -ji^th each of green, blue and yellow 
balls. 

Turn now to our actual experiments on bisecting a line. The 29 
experiments are like 29 dips into a bag, resulting in the numbers on 
p. 15 being drawn. From our general consideration of the problem 
we beUeve that the numbers in the bag form a continuum, that is we 
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think that another bisection mark might fall anywhere within the range, 
and not only at points already struck, though of course our measurements 
are only being made to the nearest tenth-millimetre. Secondly we think 
that a curve of the form 

y= -JL e-a;V(2<r'') 

^ aV(27r) 

will express this continuum. The problem is, which curve of this form 
is the best fitting one. This is decided in exactly the same way as was 
the above example of the coloured balls. For each curve find the 
probability that the actually observed distribution may have arisen 
from it by random sampling. Then that curve is the best for which this 
probability is greatest. 

The actual process of trial and error giving now this now that value 
to a and a would take too long, and we find the optimum values by the 
usual process of the differential calculus. Let us do so in a quite general 
manner, taking n values (instead of 29 as here). 

Given a distribution following the law 

«= I -(a:-a)V(2<r'') 

what is the probability Q that the n special values 

^1} ^2> "^J ■^4> ••• •^>^> ••• •^B 

will be obtained in a set of n trials (the order being immaterial) ? The 
probability of Xt, is 7„ 

aV{2'T) 

and the required probabihty is equal to the continued product of a 
number of such expressions, in which A takes the values 1 to w succes- 
sively, multiplied by n ! because the order is immaterial. 
We therefore have 

This probability we wish to make a maximum by choosing the best 
values of a and a. We must therefore put 

dQjda = 

and dQ/da = 0. 

Now f= ^! f_^r e-^(-.-»)W) X ^J^h^\ 
da \ctV27t/ <^ 
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and since this equals zero we must have 

S (x^ -a) = 0, 
.'. S {x^) = S (a) = na, 
a = S {x\)jn. 
That is, a must be the mean of the observations. Turning now to the 
second equation we have (omitting from the first the factor dx/-\/{2TT), 
which is independent of a) 



All 
Therefore 



S(x^-am2a') 



-S{x^-a)y2ff' 



S (X), - af - na'^l = 0, 



no^ = S (X), — of, 
G^ = S {xi, — a)^/n, 



i.e. a is the standard deviation of the readings. 

We find therefore that to obtain the best fitting curve we must 
find a the value of the mean of the observations, and a the value of the 
standard deviation. In the case in question 

a ^ 60-13, 
ff= 1-38, 

so that y ■■ 



-(a;-60-13)V(2xl-382) 



l-38v'(2w) 

where x is in mms. This curve is drawn on the adjoining figure. Calcula- 
tions, such as those required to find a number of ordinates of the above 
curve for the purpose of drawing it, are best performed in tabular 
fashion. For example, the present calculation might be arranged as 
follows, and the reader should calculate one or two ordinates for practice 
by this means. 



(a) 


(6) 


(0) 


(d) 


(e) 


(/) 




a; -60-13 


2 X l-SS" 


Column c 
X loge 


Reciprocal of 
Antilog. column d 


29 




Lolunm c X J gg^^g^j y 















This arrangement is suitable for an approximate calculation using 
a ten inch slide rule, from which the logarithms are also taken. The 
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accuracy thus attained is quite as great as the extent of the experiment 
justifies. If logarithm tables, or calculating machines, were to be used, 
somewhat different tabular arrangements would be required. 

Some of the calculation in the above table has, however, once and 
for all been done and printed in tables of the probability curve. The 
best of these for our purpose is Sheppard's table, printed as Table II 
n Professor Pearson's Tables for Statisticians and Biometricians. 




56 



57 58 59 60 61 

Millimetres 
Fig. 6. A normal curve fitted to the bisection data. 
The histogram is also shown 

Fitting a Normal Curve to the Bisection Data with the 
Aid of Sheppard's Tables* 



I 


II 


III 


IV 


V 


X in mms. 


a;' =a;- 60-13 


Sheppard's x 
a;71-38 


Sheppard's z 


y=:29z/l-38 


6013 

60 

59-5 

59 

58-5 

58 

57 

66 




013 

0-63 

113 

1-63 

2-13 

313 

413 




009 

0-46 

0-82 

1-18 

1-54 

2-27 

2-99 


•3989432 

•397 

•359 

•285 

•199 

•122 

•030 

•005 


8-38 
8^34 
7^54 
599 
4^18 
2^56 
063 
Oil 



The other half of the curve can be drawn by symmetry. No interpolation is here used in 
the tables. A slide rule, or Crelle's Calculating Tables, can be used for the multiplications. 

* Of course an experiment of only 29 observations does not justify any curve fitting 
at all, as the accuracy is not sufficient, the sample not being large enough. But a short 
example is necessary for explanatory purposes in a text-book. 
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We are at present only concerned with Sheppard's first and fifth 
columns headed x and z respectively. These are connected by the 
relationship , 

that is they give a curve with N =1 and cr = 1. The x of Sheppard's 
table is, therefore, obtained from our x by division by a, and his z needs 
to be multipUed by iV/cr. Our calculation then takes the form shown 
above Fig. 6. 

It must be clearly understood that there are two parts in any curve 
fitting problem. Firstly there is the decision as to what kind of curve 
is to be used, and secondly the finding of the best fitting curve of this 
kind. If it were merely a question of getting a curve to fit the particular 
data of one problem, then a curve could always be drawn to fulfil the 
conditions exactly. It is only because general considerations dictate that 
the curve shall be of a certain kind, that an exact fit cannot be obtained 
and the best fit has to be found. 

(8) THE METHOD OF LEAST SQUARES 

The principles adopted in the last section are those which underlie 
the Method of Least Squares, which is employed to find the best 
solutions of a set of hnear equations which are more numerous than 
the unknowns, and slightly inconsistent with one another. To illustrate 
the principle take three equations for two imknowns, 

ax + by + c =0, 
a'x + b'y +c' =0, 
a"x + b"y + c" = 0. 

The quantities a, a', a", b, etc. having been measured, the equations 
are found not to come to zero for any values of x and y, but to give 
"residuals" v, v', and v" thus: 

ax + by + c = v, 
a'x + b'y + c' = v', 
a"x + b"y + c" = v". 

Now the presence of these residuals v may be assumed to be due to 
numerous small errors in the coefficients a^ b, c, and the distribution of 
any v, were one of these equation-observations to be repeated many 
times, will, it may be assumed, be a Normal Curve. The probability 
of occurrence of v will therefore contain as chief factor e""^ and the 
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probability of occurrence of v, v' and v" (presuming them independent) 
will contain the factor 

e-"' X e-'^' X e-"" or e-'SW 
where >S (v^) = v^ + v'^ + v"^. 

To make this probability a maximum we must make ;S (v^) a minimum, 
whence the name Least Squares. The conditions for S (v^) a minimum 
are 

dS/dx = dSjdy = 0, 

and also (though in practice it is not necessary to find them) the second 
differentials must be negative. We get at once 

^ i.{ax + by + cf + (a'x + b'y + c'Y + (a"x + h"y + c"f\ 
= 2 (a2 + a'2 + a"2) a; + 2 (a6 + a'V + a"h") y + 2{ac+ a'c' + a"c") = 0, 

and similarly for y. 

The two equations thus reached are called Normal Equations. There 
will of course always be as many of them as there are unknowns. If the 
original equations were not of equal rehability or weight they must of 
course first be multiphed by their weights. The rule of Least Squares 
is therefore: To obtain the Normal Equation for any unknown x, multiply 
each equation by its weight w, and by the coefficient ofxin that equation, and 
then add all the equations together. The Normal equations in our example 
are thus: 

S (aho) x + S (abw) y + S (acw) = 0, 
S (abw) x + S (b^w) y + S (bcw) = 0, 
whence unique values of x and y can be found. 



CHAPTER III 

THE PSYCHOPHYSICAL METHODS 

Experimental methods and mathematical processes — ^The method of limits — ^The 
method of average error — ^The constant method — Difference thresholds and the 
probability of a judgment of a certain category. 

(1) EXPERIMENTAL METHODS AND MATHEMATICAL PROCESSES 

The experimental determination of absolute and difference thresholds 
or limina is compUcated and difficult. A considerable number of physical 
and psychological, and, it may be added, mathematical, factors is 
involved, of varying relative importance in different cases. The result 
is that different methods of procedure have been found most suitable 
for different cases. These methods have been traditionally grouped 
under three (or four) distinct headings, and called the Psychophysical 
Methods. They are: 

(1) the Method of Limits (Method of Minimal Changes), 

(2) the Method of Average Error (Method of Production), 

(3) the Constant Method (Method of Eight and Wrong Cases). 
A fourth method is generally added to the Ust, viz.: 

(4) the Method of Equal Appearing Intervals, or Method of Mean 
Gradations, but this is really no new method. It owes its special name 
to the nature of the task which it fulfils, viz. the determination of equal- 
appearing {uehermerklich) sense-distances as distinguished from just 
perceptible (ebenmerklich) sense-distances. The method which it employs 
falls under one or other of the first three headings. 

There are two things, essentially different from each other, which 
are commonly confused under this one heading "psychophysical 
methods," namely the methods of experimenting in order to obtain 
data, and the processes of calculation after the data have been collected. 
To avoid this confusion the words "method" and "process" will be 
employed throughout this book in the way indicated by their use in 
the above sentence. It is urged that their general adoption would be 
advantageous. The cause of the confusion is to be found in the historical 
development of the subject, for with each of the methods of experi- 
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menting a process of calculation was associated and the one name was 
given to both. 

The experimental methods of determining thresholds may be 
divided into two main groups: 

1. Methods in which the stimulus is altered continuously until in the 
opinion of the subject it fulfils some given condition. 

2. Methods in which various values of the stimulus are Separately 
submitted by the experimenter to the subject who expresses a 
judgment on each of them, classifying them into two or more 
categories. 

The methods belonging to the second group may in turn be classified 
according to the order in which the stimuli are submitted to the subject. 
The second group is thus subdivided as follows : 

2 (a). Methods in which the order of succession of the stimuli is 
irregular, non-consecutive. 

2 (b). Methods in which the order of succession of the stimuli is 
consecutively (i) ascending or (ii) descending. 

Under 2 (a) comes the Method of Right and Wrong Cases. Under 
2 (6) come the Method of Minimal Changes and the Method of Serial 
Groups*. The latter is a special case of the Method of Minimal Changes, 
in which each value of the stimulus is submitted a number of times 
before the next consecutive value is submitted. A corresponding method 
under 2 (a) is the Method of Non-Consecutive Groups, in which groups 
are taken in an irregular order. It is customary in the Method of Serial 
Groups to introduce among the stimuh an equal number of " catch " cases, 
in which no stimulus (or stimulus difference) at all is given. This might 
also be done in other methods. Each method can be further subdivided 
according as the subject is or is not warned beforehand of the kind of 
succession to expect. 

The differences which result from the use of these various "methods" 
are clearly due to their different psychological effects upon the subject. 
But we have also to take into consideration the differences in the 
"processes" of calculation. These, of course, are purely mathematical 
and ignore the subject altogether. 

* We adopt here the convenient name suggested by G. M. Stratton who first described 
the method {Psychol. Rev. 1902, ix. pp. 444 — 447). It had, however, been independently 
discovered and employed by W. MoDougaU four years previously during his stay in 
Murray Island (Rep. Camhridge Anthrop. Expedition to Torres Straits, Cambridge, 1903, 
n. pp. 190—193). 
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(2) THE METHOD OP LIMITS 

In using this method for the determination of difference limina the 
following mode of procedure is adopted. The variable stimulus V is 
first made equal to or slightly larger than the standard S, and then 
increased step by step by small increments until the subject finds it 
just perceptibly greater than S. V is increased still more, and then 
gradually diminished until it just ceases to appear greater than S. The 
mean of the two values of V — S thus obtained is the upper difference 
limen, ?„. 

Four values, instead of two, might also be obtained, viz. for (i) V just 
not perceptibly greater than S, (u) V just perceptibly greater than S, 
(iii) F just perceptibly greater than S, (iv) F just not perceptibly greater 
than S; (i) and (ii) being for ascending values of V, (iii) and (iv) for 
descending. T„ will be the mean of these four values of F — S. 

The lower difference limen, Ti, is obtained in a similar way. Both 
limina may be obtained in the same series of experiments by the 
"method of complete descents and ascents." 

A series of determinations of each limen is made and the average 
taken. A measure of scatter (mean variation, say, or standard deviation) 
is also calculated. Absolute thresholds may also be found by this method. 

The number and size of the increments employed must be adjusted 
to the particular conditions of the experiment. The subject of the 
experiment should be given a certain amount of preliminary practice 
before being started upon the work, and introspective reports should be 
asked of him. 

Among the various possible sources of error which deflect the 
subject's judgment both in this and in the other psychophysical methods, 
two are of special importance. They arise from the temporal and spatial 
arrangement of the compared stimuli, and are called the "time error" 
and "space error" respectively. Thus, in a determination of the 
difference limen for sound-intensities, the two stimuli, S and V, cannot 
be presented simultaneously. One must precede the other, and in this 
way it may produce a slight degree of fatigue which causes an over- 
estimation of the other, or it may produce the reverse effect of sharpening 
the attention to the second. Thus a time error arises. Again, in experi- 
ments with brightness-intensities or visual extents, where the stimuli 
can be presented simultaneously, a space error arises from the fact that 
the one stimulus must be presented either to the right or to the left of 
the other stimulus, and the subject's judgment varies accordingly. In 
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experiments with lifted weights both sources of error may be involved. 
These so-called constant errors may be approximately neutralised by 
arranging that in the course of the experiment the standard shall precede 
the variable or stand to the right of the variable in half the cases and 
follow or stand to the left in the other half — the time or space order, 
or both, being of course changed quite at random in the successive 
limen determinations. A better plan is to evaluate the hmen, and its 
scatter, for each time and space order separately. This gives us the 
values of the time error and the space error, which are of interest for 
their own sakes. Fechner's theory of these errors and their measurement 
is based upon the assumption that the time and space orders of the two 
stimuli, S and F, exert an influence upon the result which is equivalent 
to a definite increase or diminution in the stimulus-value of one or the 
other, and thus increase or diminish the value of »S ~ F by the amount 
of the time error e^, or that of the space error e^j, or by the sum or the 
difference of these two. The time error is positive or negative according 
as the effect of the time order is to increase or diminish the apparent 
value of the fast-presented stimulus. The space error is positive or 
negative according as the effect of the space order is to increase or 
diminish the apparent value of the left-hand stimulus. 

Four principal cases of time and space order are possible, and are 
conventionally numbered as follows*: 

I Standard presented first and to the right, 
II „ „ second 

III ,, „ first „ left, 

IV „ „ second 

Employing these numbers as suflixes, we have the equations 



Tn, =T„-e^ + e,, 


T,^ =r, + ei-62, 


Tu,, =r„ + 61 + 62. 


2'i„= Ti~ej^-e2> 


T„,,, = T,-e,-e,, 


Ti,,rTi + e^ + e^^ 


T.,,-T^ + e^-e^, 


Ti^^=^T^-e^ + e^, 


whence 




rp J^Uj + -^Miv -'•'II + -'"III 

^"2 2 


Tui + -^ Mil + ^"III + ^■^Vf 

4 


and a similar expression holds for Tf, 


and 


26i = y^,! - T„^ = T^jy - ^«i„ 


= Ti^ - T,jj = Tjjjj - Ti^^ , 


26, = T„^ - T„„, = T„„ - r„,^ 


= ^^i„-^^. = ^v-^r 



* G. E. Miiller, Die Oesichtspunkte und die Tataachen der psyehophysischen Methodik, 
1904, pp. 67, 71. 

B. & T. 4 
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The errors due to expectation, habituation, fatigue, etc., are neu- 
trahsed or at least reduced to a minimum by determining the limen by 
means of both ascending and descending values of F and averaging, and 
by other special precautions in applying the method. 

The Mathematics of the Method of Limits 

It is to Professor F. M. Urban that we owe the most complete dis- 
cussion of the mathematical foundations of this method*. 
Let the variable stimulus V have the values 

*1) *2' *3j ••■ *n- 

Then for constant experimental conditions there will be. Prof. Urban 
assumes, for each stimulus a probabihty that a certain judgment (say 
Greater) will be given. Let these probabilities be 

Vi> P2. Ps. ••• P™- 
If the stimuli were arranged in order of magnitude beginning with the 
smallest, then these probabiUties will also be so arranged. Let us 
further write q for the probabihty that this judgment will rwt be given. 
Then for each q and f we have 

l-'p = q. 
Consider ascents first. A stimulus s is noted as a just perceptible 
point if the answer greater is given at s and was not given at any lower 
point in that series. This is a compound event, and its probabihty is 
the product of a number of q's, for the lower stimuh where greater was not 
the answer, and one f for the stimulus s itself where the answer greater 
is returned. Let P represent the probabihty that s will be noted as one 
reading of the just perceptible point, then we have the set of equations 

Pi = Vi, 
^3 = MaPs. 



Pn = ili2---ln-l'Pn, 

and the mean of the just perceptible points will be 

T = s^Pi + s^P, + S3P3 + ... + s„P„ = S (sP). 

The standard deviation will be given by 

a^ = S {(s - Tf P} = S (s^P) - T\ 

* See "Die psyohophysisohen Masamethoden,'' Archiv f. d. ges. Paychologie, 1909, xv. 
p. 289; "On the Method of Just Perceptible Difierences," Psychol. Rev. 1907, xiv. p. 244; 
The Application of Statistical Methods to the Problems of Psychophysics, Philadelphia, 1908. 
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An example will make the whole of this much clearer. Weights of 
84, 88, 92, 96, 100, 104 and 108 grams were compared, by lifting, with 
a standard weight of 100 grams, and the replies given, which referred 
to the variable weight, were heavier, equal and lighter. They were 
recorded as follows : 



34 


88 


92 


96 


100 


104 


105 


e 


1 


h 


h 


e 


h 


h 


I 


h 


e 


h 


h 


h 


h 


I 


1 


1 


h 


h 


h 


h 


1 


I 


I 


e 


e 


h 


h 


1 


1 


e 


h 


h 


h 


h 



and so on for 400 rows, the letters h, e and 1 being the initial letters of 
the answers given. The five rows shown in extenso give five readings of 
the just perceptibly heavier point, vi^:. 92, 88, 96, 104 and 96 grams. The 
400 of these obtained from the 400 rows were distributed as follows 
(Urban's Subject I): 

Grams 84 88 92 96 100 104 108 

Frequency 7 30 76 106 169 12 400 in all. 

The mean of these points is 100-36, which is therefore the directly 
observed threshold of just perceptible positive difference. (It should be 
noted that a time error is included which accounts for the nearness of 
this point to the standard.) The standard deviation is 4-35 grams. 

The frequency with which the answer heavier was given at each 
stimulus can also be found from the records. The application of Urban's 
Formula, based on these frequencies p, is then as follows: 
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* See explanation in text to follow. 
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The calculated point of just perceptible positive difference is therefore 
100-04 grams and the standard deviation 4-56 grams. It will be seen 
that a working origin has been taken at 100 grams, and a working unit 
equal to 4 grams, to simplify the arithmetic. The column headed 
q products is the continued product of the q's from the 84 end. The 
P column is most quickly formed as the differences of successive 
numbers of the product column. The P's rise to a maximum and then 
sink again. They represent in fact the cocked hat distribution of the 
just perceptibly heavier points, and should be compared with the actual 
distribution of the latter. The values P really give the distribution which 
would be found were an infinite number of ascents to be made, the 
•probabilities of the answer heavier at the different points remaining through- 
out constant at the actual frequencies which are found in these 400 ascents. 

It will be noticed that the P's do not add up to unity unless the 
amount 0-0025 is included. This quantity gives the probabiUty that 
an ascent should be made without obtaining any answer heavier. In the 
example these "just perceptibly greater" points are centred at the 
weight 112, since it is necessary to make some assumption as to their 
position. Strangely enough, Professor Urban himself omits them in his 
calculations, causing in some cases quite an appreciable error. This 
dif&culty about the "taU" of the P distribution would not have been 
necessary had the experiments been extended to a point where all the 
answers were heavier. The pecuhar difficulty about this is however that 
the psychological conditions would thereby be changed*. The "tail" 
difficulty will therefore always be present in threshold measurements. 
It causes mathematical troubles in all the methods except only in the 
process of calculation used in the Constant Method, and will be fre- 
quently referred to in this and the succeeding chapters. 

If the steps between the stimuli are equal, as here, a saving in 
arithmetic which escaped Professor Urban's notice can be effected by 
using the summation method described on page 19. The sum of the 
q products gives the distance of the threshold from the end stimulus. 
If the equal increments between the stimuli are not unity but x, the 
sum of the q products must first be multiplied by x. The standard 
deviation can also be obtained by a further application of the summation 
process; the details are left to the reader who should consult the next 
chapter. The consideration of this point here might lead us too far 
from the main argument. 

* See inter aUa the article "On Judgments of Like," Frank Angell, Am. Joum. 
Psychol. 1907, xvin. p. 253. 
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In the same experiment the points of just not 'perceptible difference 
were distributed as follows: 



Grams 
Frequency 



84 




88 
8 



92 
36 



96 
99 



100 
200 



104 
35 



108 
22 



400 in all. 



Of these the mean is 98-84 grams, and the standard deviation 
4-02 grams. The application of Urban's Formula in- this case gives the 
following results : 
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The mean threshold is therefore 



by the direct method 
by Urban's Formula 



(100^36 + 98^84)/2=99^60 grams, 
(10004 +99-14)/2 =99^59 grama. 



Professor Urban's own calculated values are sUghtly different, partly 
because he neglects the tail of the distribution as above explained, 
partly because of an arithmetical error. 

The practical applications of Urban's Formula are few, though it is 
occasionally useful in cases where the direct observation of the just 
perceptible points is impracticable*. Its great value lies in the con- 
clusions which it enables us to draw. In the first place it shows clearly 
that the order in which the stimuli occur has no effect on the value of 
the threshold from a mathematical point of view, for the sum of the 
quantities sP is independent of their order, and the P's themselves are 
equally independent thereof. This does not of course mean that a change 
of order of the stimuli will have no psychological effect. There is no 
doubt that the effect of a non-consecutive series of stimuli will be very 



* See e.g. G. H. Thomson, "Changes in the Spatial Threshold during a Sitting," 
Brit. Journ. Psychol. 1914, vi. p. 438. 
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different from that of an ascending or descending series, inasmuch as 
the latter will arouse feelings of expectation and the like. Professor 
Urban therefore recommends that except when it is specially desired to 
study the results of ascents or descents the stimuli should be arranged 
in non-consecutive order. In fact it will be seen that the above described 
simple mathematical process can also be appUed to data collected by the 
Method of Right and Wrong Cases. We have here an example of the 
distinction between methods of collecting data on the one hand and 
processes of calculation on the other. The Limiting Process of calcula- 
tion can be appUed to the Constant Method of collecting data. 

Professor Urban however, although recommending non-consecutive 
stimuli, does not recommend that the data should be collected in the 
way usual in the Constant Method if it is intended from the outset to 
apply the Limiting Process. On the contrary, instead of keeping the 
stimuli constant throughout the experiment, he advises a frequent 
change of stimulus-series. For he found, from consideration of the 
results of interpolating and of irregularly spacing stimuli, that the final 
values obtained by the Limiting Process are dependent upon the par- 
ticular stimuli used, in a mathematical way. To free results from this 
bias as far as possible, he recommends the following procedure. The 
steps in any one series should be of equal size, and the starting point 
of the steps should be altered from series to series until the whole area 
has been covered. For example, if the spatial threshold were being 
investigated, the stimuU used might be* 

Krst series 0, 1, 2, 3, 4, 5, 6, 7, 8 centimetres; 
second series 0-1, M, 2.1, 3-1, 41, 51, 61, 71, 81; 
third „ 0-2, 1-2, 2-2, 3-2, 4-2, 6-2, 6-2, 7-2, 8-2; 



tenth „ 0-9, 1-9, 2-9, 3-9, 4-9, 5-9, 6-9, 7-9, 8-9. 

The series might of course follow each other in any order. When this 
is done. Professor Urban shows that the final Hmen arrived at is the 
point where the probabihty of the response in question is one-half. 
That is, the Limiting Method finds the median threshold^. 

A modification of the Method of Limits, which effects a more com- 
plete elimination of the expectation error by the use of "catch experi- 
ments," where the variable given is equal to the standard, and which 

* See 6. H. Thomson, "Changes in the Spatial Threshold during a Sitting," Brit. 
Journ. Psychol. 1914, vi. pp. 436 — 6, where this plan is followed. 

t For other discussions of the mathematics of the Limiting Process and allied subjects 
the reader may consult articles by Wirth, Urban, Lipps in the Archivf. d. gee. Psychologic, 
etc. 
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also collects data at a quicker rate than the ordinary Method of Limits, 
is that known as the Method of. Serial Grov/ps*. 

Each fixed value of the variable stimulus is presented with the 
standard ten times, not in immediate succession, but interspersed at 
random among ten other values of V equal to S. The percentage of 
correct answers given by the subject is noted, and the experimenter 
passes on to the next value of F which is presented along with catch 
stimuli in a similar manner. The value of F which, as presented in this 
way, gives 80 % right answers, is arbitrarily chosen as measuring the 
limen. Of course this choice of 80 % makes the method measure a 
totally different point from the 50 % point measured by the Method of 
Limits, but this is not essential to the method as a method of collecting 
dataf. As such it is a very convenient one to use in measuring a large 
number of subjects in "mental test" experiments, or with primitive 
people, where economisation of time is essential. 

The mathematical theory of this method is similar to that of the 
Method of LimitsJ. It has been shown that mathematically the Method 
of Limits is superior, and that the mathematical disadvantages increase 
with the size of group taken. For this reason groups of four have been 
suggested instead of ten §. Further, the arbitrary use of the 80 % point 
obscures comparison with the results of other methods. The 50 % point 
would be better although it does not allow so large a number of subjects 
to be measured in a short time as does the 80 % point, if, as is assumed, 
each descent is stopped as soon as the required point is reached, and 
a new descent begun. With groups of four, the 75 %, 50 % and 
25 % points can all be noted if time permits of complete descents 
and ascents (thus giving both the median limen and a measure of 
scatter, the interquartile range), while if time pressed, the 75 % point 
would be sufficiently comparable with the 80 % point of previous 
experiments. 

As in the Method of Limits, the mathematical foundations are 
unaltered if the groups cease to be in serial order. We thus arrive at 

* See Text Book of Experimental Psychology, C. S. Myers, Cambridge, 1911, p. 196. 

t See G. M. Stratton (Psychol. Review, 1902, ix. pp. 444 — i47), who says "a detaU 
like this, as well as the exact number of experiments that may beat form a group, might 
well be considered as subject to revision in the light of further experience and not as an 
essential part of the method." 

X G. H. Thomson, "A Comparison of Psychophysical Methods," Brit. Joum. Psychol. 

1912, V. p. 212. 

§ "An Inquiry into the Best Form of the Method of Serial Groups," Brit. Joum. 
Psychol. 1913, v. pp. 398 — 416; and "The Probable Error of Urban's Formula," ibid. 

1913, VI. pp. 217—222. 
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a Method of Non-Consecutive Groufs*, which has been held by one 
experimenter to be the best method of collecting data. 

It may be pointed out in passing that the Method of Groups is one 
which is naturally employed in other branches of mental measurement 
as well as in psychophysics. For example, the widely known Binet Tests 
are given in groups beginning at a group designed to suit a child of very 
young age, and are proceeded with until a certain percentage of passes 
is obtained at some group. Usually however modifications in the form 
of marks for tests passed above the critical group are admitted. There 
is no doubt but that the mathematical foundations of many of these 
devices require examination in the light of the theory of probabilityf . 

It may also be pointed out that as methods of collecting data the 
Method of Serial Groups, and that of Non-Consecutive Groups, especially 
the latter, approach the principle of the constant method yet to be 
described, and indeed their data can very well be handled by the pro- 
cesses in use in the latter method. 

(3) THE METHOD OP AVERAGE ERROR 

In this method the subject is required to adjust a variable stimulus 
so that it seems subjectively equal to a given standard stimulus. In this 
it differs considerably from the other methods in which the experimenter 
does the adjustment, usually out of sight or even before the sitting 
starts, and the subject expresses an opinion on each stimulus but does 
not alter it. The alternative name of the present method, viz. the Method 
of Production, suitably emphasises this important psychological point. 

Mathematically the method presents differences which are mainly 
due to the fact that in this method almost any value of the variable 
stimulus can crop up, whereas in the other methods the experimenter 
customarily keeps to certain steps, so that the results are as it were 
heaped up at certain points. 

The experiment is repeated a large number of times — at least 100 — 
and the arithmetical mean of all the obtained stimulus- values is calcu- 
lated. The difference between this mean value and the standard stimulus 
is known as the crude constant error e. It may be either positive or 
negative. A measure of scatter is also found. 

The crude constant error may be partly due to a space error (the 
time error cannot occur in this method), partly to other constant 

* Thomson, Brit. Journ. Psychol. 1912, v. p. 205. See also Brit. Journ. Psychol. 1914, 
VI. p. 434, where this method was employed. 

t See Francis N. Maxfield, "Some Mathematical Aspects of the Binet-Simon Tests," 
Journal of Educational Psychology, 1918, ix. pp. 1 — 12. 



CH. m] THE PSYCHOPHYSICAL METHODS 57 

conditions*. Let us assume that the experiment is to adjust the length 
of a variable line until it seems to be equal in length to a standard line. 
In this case, the standard should be situated to the right in one-half 
the number of adjustments, and to the left in the other half, either 
alternately or in haphazard order. The results are tabulated in two 
columns, I standard to the right, II standard to the left, and the means 
of these two found separately. Half the difference of these means is 
the space error, while half their sum, less the standard, gives the residual 
constant error. 

In order to give as much definiteness as possible to the task' of 
adjustment, the variable should start sometimes shorter than the 
standard, sometimes (an equal number) longer, and the adjustment be 
made by lengthening or shortening respectively. Again, the requisite 
amount of shortening or lengthening should be arranged to be different 
on different occasions, but alternating with some degree of regularity. 

The value of the scatter obtained in this method is from a general 
point of view a more important result than the value of the constant 
errors, since it has often been regarded as proportional to the value of 
the difference threshold as determined by the other two psychophysical 
methods. The truth is that although there is a certain amount of 
proportionality between the values, this proportionality is not complete. 
Under certain conditions the two values vary in opposite directionsf. 
It is hardly necessary to point out that the scatter of the individual points 
obtained in the Method of Limits has but little relation to that reached 
by the Average Error Method. They are not entirely unrelated, however. 

The closest correspondence of any is that between the distribution 
of the errors in the present method and that of the judgments "equal" 
in the Constant Method. 

(4) THE CONSTANT METHOD 

This is generally regarded as the most satisfactory of the psycho- 
physical methods. It can be employed with equal convenience for the 
determination of absolute thresholds, difference thresholds, equal- 
appearing sense-distances, and other measurements of psychological 
importance. The different values of the variable stimulus to be employed 
are fixed once for all at the beginning of the investigation, and are 
presented to the subject a large number of times (say 100 applications 

* Cf. E. B. Titchener, "Experimental Psychology," Student's Manual, n. p. 74: "A 
constant error is simply an error whose conditions are constant; its amount may vary, 
quite considerably, from stage to stage of a Jong series of experiments." 

t This question is coimected with the point discussed on p. 75 et seq. 
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of each) in irregular order, or in a prearranged order, unknown to the 
subject, corresponding to certain precautions. If an absolute threshold 
is being determined, the variable is presented alone; if a difference 
threshold, it is on each occasion preceded, accompanied, or followed by 
the standard. In the latter case, the subject returns the replies greater, 
uncertain or equal, less, with reference either to the standard, or to the 
variable, or to the first presented stimulus, or on some other prearranged 
plan. The percentage of each of these three types of answers is deter- 
mined for each value of the variable used, and recorded (it is also 
advisable to retain the raw data in the form of the actual answers in 
the order given). 

As one illustration of this method, we shall find it convenient to 
refer to a series of results obtained by Eiecker (Zeitschrift f. Biologie, 
Bd. X.) which has already served in the descriptions of the method 
given by Miiller, Titchener, and Myers. Eiecker obtained the following 
results in an investigation of the "spatial threshold," or threshold of 
two-point judgments of the skin of the lower eyelid: 

s (distance between the points of 

the aestheaiometer in Paris 

lines)* 
lOOp (% of two-point judgments 

It will be observed that, with two exceptions, the series of per- 
centages follows a general law of increase, the rate of increase itself 
increasing at first and then diminishing. The numbers in the lowest 
line are formed by subtracting each percentage from the immediately 
succeeding one, and show this uniformity more clearly. The two excep- 
tions are at s = and s = 5. At s = the percentage is greater than at 
the immediately succeeding stimulus. This irregularity is known as an 
inversion of the first order. The cause of it is doubtless to be looked for 
in the exceptional way in which the stimulus (one point) may have been 
applied, the pressure and general nature of the contact may have been 
different from what they were with two-point contact, or some mis- 
leading suggestion may have accompanied these particular experiments. 

At s = 5, the percentage is indeed larger than that for s = 4 and 
smaller than that for s = 6, but reference to the differences shows that 
the increase from 4 to 5 is greater than the increase from 3 to 4, thus 
breaking the general rule as regards rate of increase. This irregularity 
is known as an inversion of the second order, and being in the present 

* A Paris line =2-25 mms. 
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case slight, is probably to be explained as due to an insufficiency in the 

number of applications of the stimulus. 

As a second example, in this case one dealing with a difference 

threshold, we shall take the data for lifted weights which have already 

been used on p. 51 in connection with the Method of Limits. The 

standard weight was 100 grams and was lifted before each of the seven 

comparison weights. The judgnients given were lighter than, equal to, 

or heavier than the standard. With Subject I the answers heavier were 

distributed as follows, in 450 trials with each weight : 

Comparison weight s • 84 88 92 96 100 104 108 grams 

Answers heavier 1 9 40 100 186 403 423 

Proportions f) -0022 -0200 -0889 -2222 -4133 -8956 -9400 

The third row of this table is simply the second row divided by 450*. 
A figure of the quantities f forms a curve which, following Galton, we 
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Fig. 7. Urban's Subject I. j) = proportion of answers icomer 

may describe as an ogive. It is shown in Fig. 7, where however no 
attempt is made to draw a smooth curve through the points, which are 
merely joined by straight lines. 

Here there are no inversions of either order, and the example is 
therefore a more convenient one for a first explanation of the various 
processes of calculation which may be adopted. We shall for this reason 
deal with it first, returning later and considering Riecker's more irregular 
numbers. 

We must commence by defining the limen, and do so in the first 

* This figure is correct. But (as stated on page 51) Urban only gives 400 values of 
the just perceptibly heavier points: we are unaware of the reason which led him to discard 
the other 50. 
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place as the point corresponding to 50 % of the judgments in question. 
For example, in our case we define the limen for the response heavier 
as being the point where the subject would give half his answers heavier, 
the other haK being lighter, equal, or whatever other answer is allowed, 
but not heavier. Above this point he is more likely to answer heavier 
than not, below it he is more likely not to give this reply. 

The hmen corresponding to 50 % heavier judgments may be calcu- 
lated in two general ways: (A) from the observed values, (B) by finding 
the best fitting smooth curve, adjusting the observations to this, and 
then calculating the constants (mean, mode, scatter, etc.) from the curve. 

A (1). Linear Interpolation 

The required limen obviously falls somewhere between 100 grams 
(41-33 % answers heavier) and 104 grams (89-56 % answers heavier). 
A very simple way therefore of determining its value is to assume these 
points joined by a straight line, as in Fig. 7, and find where this line 
cuts the 50 % line. Arithmetically this means finding a weight which 
divides the interval from 100 to 104 grams in the same proportion as 
50 % divides the interval between 41-33 % and 89-56 %. If T be this 
weight, we have then 

r - 100 _ 50 - 41-33 

lOi- T~ 89-56 - 50 ' 
and with practically no calculation we obtain the value 100-72 grams 
for the limen. 

This method though commendably simple is open to several ob- 
jections: 

(1) It does not employ all the data ; it uses two of the percentages 
only. 

(2) The assumption that the curve is a straight line at this point 
is unlikely to be exact. 

(3) It gives no measure of scatter. 

A very simple extension of the above idea has been suggested* 

which obviates the third and to some extent the first of these objections. 

This is to calculate the 75 % and the 25 % points by the same simple 

form of linear interpolation as that employed for the limen itself, that 

is, on the Fig. 7, to find where the zig-zag ogive crosses the -25 and 

•75 lines. These distances are readily found by a short calculation 

to be 96-58 and 102-79 grams respectively. Half the interval between 

* G. H. Thomson, "A Comparison of Psychophysical Methods,'' Brit. Journ. Psychol. 
1912, V. p. 210 footnote. 
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them, namely 3-10 grams, is the semi-interquartile range, a rough but 
very practical measure of scatter. 

The fact that the 50 % point is not half-way between the 25 % point 
and the 75 % point is a rough indication of the skewness of the data : 
it divides the interquartile range in the proportion 4-14 grams below 
and 2-07 grams above. 

Moreover, it will be found that in practice the mean of the three 
values, the 25 %, 50 %, and 75 % points, gives a fairly good approxi- 
mation to the threshold found by more comphcated calculations assuming 
a symmetrical distribution. In the present case this gives 

96-58 + 100-72 + 102-79 , . . . „ 

= 100-03 grams. 

o 

A (2). The Arithmetical Mean* {Spearman's Formula) 

A value which can be calculated by a use of all the data is that of 
the mean or average limen. Before proceeding to consider this, it is 
important to reaUse clearly the fact, which G. E. Miillerf was the first 
to point out and emphasise, that a limen is a variable magnitude 
foUowiag a law of frequency-distribution. There is no fixed limen, only 
an average limen, a most frequent Umen, or a most representative limen. 
The p's in the table (p. 59) represent the relative frequency of limina 
for stimuli below the corresponding s. Thus there were 41-33 % hmina 
below 100 grams, and 22-22 % limina below 96 grams; i.e. there were 
41-33 — 22-22 or 19-11 % limina between the limits of stimulus-values 
96 grams and 100 grams. This suggests the plan of plotting a frequency- 
polygon or histogram for the limina using these differences of the -p's. 
For the present case the table of differences runs : 
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* "The Method of Right and Wrong Caaes without Gauss's Formulae," Brit. Journ. 
Psychol. 1908, n. pp. 227—242. 

t This at least is the view to which Miiller himself inclines {Gesichtspunkte, p. 59), 
but he deduces his formulae on the assumption, supported by Fechner and Bruns, that 
the threshold has a single definite value, subject in the course of an experimental deter- 
mination to variable apparent increase or decrease by random influences which obey the 
Normal Law of error. He points out that both views lead to the same formulae. Indeed 
the distinction ia merely a verbal one, in our opinion. 

1 This is not necessarily an inversion of the second order, for it may be spread out 
to any distance above 108 grams. 
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and the histogram, or pseudo-histogram* as it is safer to call it, is given 
in Fig. 8. 

If we now wish to proceed to the calculation of the mean of these 
limina we must decide where to centre those which He between say 
92 and 96 grams. It is well known that this centre must be taken in 
the actual geometrical centre of each intervalf , and Professor Spearman's 
suggestion to use not the mid-ordinate, but an ordinate shifted slightly 
towards the higher of the two neighbouring columns, is unnecessary. 
The unadjusted value is the best value of the mean. The source of 
Professor Spearman's error hes in the fact that in his discussion he 
replaces each rectangle of the pseudo-histogram by a trapezium of 



200 



100 



j: 



31 



60 



86 



217 



84 88 92 96 100 104 108 grams 

Fig. 8. Pseudo-hiBtogram of the 450 limina. Urban's Subject I, answers heavier 



different area, and then multiplies the centroid abscissa of this new area 
by the old area, which could not possibly have the centroid of the new 
one. 

A real difficulty, and one which robs the method of much of the 
practical utihty which it would otherwise have, is the impossibility of 
fixing in any unambiguous fashion the centres of the tails of the distribution, 
that is those limina which in our example lie below 84 and above 108 
grams respectively. This is the same trouble as that referred to on 
p. 52 in connection with the application of Urban's Formula. 

* See later, p. 79. 

t Cf. e.g. W. I". Sheppard, "On the Calculation of the most probable Values of Fre- 
quency Constants," Proc. Land. Math. Soc. xxix. pp. 353 fE. 
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The simplest thing to do is to assume that at 80 grams the subject 
would never, and at 112 grams would always, have answered heavier, 
i.e. to centre the tails at 82 and at 110 grams respectively. We shall 
do so in this case, although it is clear that at the upper end the tail is 
probably more spread out than this. We then have the following calcu- 
lation for the mean or average limen : 



1 at 


82= 82 


8 „ 


86= 688 


31 „ 


90= 2790 


60 „ 


94= 5640 


86 „ 


98= 8428 


217 „ 


102=22134 


20 „ 


106= 2120 


27 „ 


110= 2970 




44852 total 



Dividing this by the whole number of limina, 450, we obtain 99-67 grams 
as the threshold by this method. The standard deviation can also be 
found, cr = 5-1 grams (but see next chapter for a correction to this 
value, known as Sheppard's adjustment). 

The summation method of finding the mean, which is explained on 
p. 19, again enables a saving in arithmetic to be effected in this calcula- 
tion. Applied directly to the frequencies p it works as follows in the 
present case, and obviates the actual formation of the pseudo-histogram. 



Freijuenoy of answers heavier at 84 grama 


•0022 


88 „ 


•0200 


92 „ 


•0889 


96 „ 


•2222 


100 „ 


•4133 


104 „ 


•8966 


108 „ 


•9400 




2-6822 aum 




4 gram unita 



Thia muat be subtracted from origin 



10-3288 
110 



1-6712 grama threshold. 



The reader must be referred to a study of the summation method on 
p. 19 to realise why in this case the sum obtained has to be subtracted 
from the origin. 

We can at this point call attention to an instructive comparison 
which can be drawn between the Method of Limits and the Method of 
Right and Wrong Cases, as regards the processes of calculation employed 
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in them, on the basis of Urban's Formula for the former and Spearman's 
Formula* for the latterf. 

Urban's Formula, S (sP), shows the threshold as the mean of a 
number of just perceptible points, which are centred at the stimuli used, 
the quantities P being the differences of successive products of the 
frequencies of the answers heavier (or not heavier). 

Spearman's Formula, as exemplified in the above calculations, can 
be written S{s.dp), where dp is the difference of the successive fre- 
quencies p, and shows the threshold as the mean of a number of hmina 
which he between the stimuU used. 

B. Methods which fit smooth curves to the data 

The idea underlying these methods is to run a smooth curve through 
the points of Fig. 7 (the f values) instead of simply joining them by 
a zigzag, and then to ascertain where this smooth curve, which does 
not necessarily go exactly through any of the points but smooths off 
any inequahties, passes the 50 % line. Any suitable curve which 
happened to occur to one might of course be employed. For example, 
a parabola of high order can be usedj, and the curve tan~^ 6 has also 
been tried §. But clearly the whole experiment suggests that an error 
function of some sort is wanted, and as early as 1860 G. T. Fechner 
suggested II that such numbers formed the integral of a Normal Curve 
of Error. 

This idea would naturally occur to anyone accustomed to handling 
the Normal Curve on considering the pseudo-histogram or table of 
differences. Fig. 8. The obvious skewness of the diagram would also 
strike such an observer, it may be noted in passing, but with this point 
we shall deal in a separate chapter. 

The obvious way to fit such a histogram with a Normal Curve is 
that given in the previous chapter, namely, to find the mean and the 
standard deviation, and use these constants in the expression for the 
curve. To this there are however important practical objections, the 
chief being the difficulty of the undefined tails, to which reference has 

* We use Professor Spearman's name for this plan since he was the first English 
writer, we believe, to call attention to it: but we do not include the erroneous point in 
his calculation to which reference has already been made. 

t G. H. Thomson, "A Comparison of Psychophysical Methods," Brit. Jonrn. Psychol. 
1912, V. p. 226. 

% F. M. Urban, "Die psychophysischen Massmethoden als Grundlagen empirisoher 
MesBungen,'" Archiv f. d. ges. Psychologie, 1909, xv. and xvi. pp. 335 — 355. 

§ Urban, he. cit. p. 393 et seq. 

II 6. T. Fechner, Elemente der Psychophyaih, 1860. 
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already been made in discussing Spearman's plan of finding the mean. 
This difficulty is still more acute when we attempt to find the standard 
deviation. This tail difficulty does not occur in the plan adopted by 
Miiller*, which fits the Normal Integral direct to the f values, not 
troubling at all about the differences forming the histogram. In other 
words, Miiller's process fits a curve to Fig. 7, not to Fig. 8, and is 
therefore more direct, in addition to the advantage it has of avoiding 
the tail problem. 

Before proceeding to the explanation of Miiller's process, it is 
necessary to notice that he employed a slightly different form of the 
equation to a Normal Curve from that used in the previous chapter. 
The latter is the form now in general use among biometricians, and it 
seems desirable that it should also be used by psychometricians, who 
otherwise would be hindered from the direct appUcation to their work 
of the mathematical improvements made by the biometric school, and 
especially would find their use of the valuable tables published by 
Professor Pearson much hampered. In the actual description of Miiller's 
work however it is better for the present to keep to his notation in this 
respect. The form of the Normal Curve used by him was 

In this s is the variable stimulus, T is the average lunen, and therefore 
also, since the curve is symmetrical, the median and the mode. So far 
the notation agrees with that used in the expression employed for the 
Normal Curve in the previous chapter, namely 

but a further comparison of the two shows that for his second constant 
Miiller has a quantity h, which is connected with the standard devia- 
tion a of the biometric formula by the relationship 

When the standard deviation is large, therefore, h is small. It is a 
measure of precision, not of scatter. 

The assumption is now made that the relationship between the 

* 6. E. Miiller, "Ueber die Maassbestimmungen des Ortssinnes der Haut mittels dei 
Methode der richtigen und falsohen Falle," Pfluger's Archiv fur die ges. Physiologic, 1879, 
XIX. pp. 191 — 235, especially par. 5 et seg.: also Die Oesichtspunkte und die Tatsachen 
der paychophysiachem Mefhodik, Wiesbaden, 1904, par. 11, where the classical description 
of this method will be found. 

B. & T. 5 
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stimulus s, and the frequency p with which the answer heavier is returned, 
is given by the equation 

^_r'' *_e-''>-3')"cfe = (1). 

J -00 V (■"") 

That is to say, it is assumed that the successively increasing percentages 
of answers heavier correspond to the increasing area of the portion of 
a Normal Curve which is shaded in Fig. 9, as the point s moves to 
the right in that figure. 

To obtain this equation in a more convenient form for our purpose, 

^^*' ' h{s-T) = t - V (2). 

This corresponds to measuring the stimuh in a special unit, and is the 




T 

Fig. 9. To illustrate Muller's proceBS 

same device as that used in another connection later by Galton. The 

equation then becomes 

1 rh(s-T) 

^-VR)L ^-'''-' (^)- 

By inserting in this equation the corresponding values of p and s 
from the table of data on p. 59, we obtain seven equations for two 
unknowns h and T, and as these equations are shghtly inconsistent with 
one another we have to decide how to calculate the most probable 
values of h and T. No pair of values will exactly satisfy all seven 
equations. Instead of coming to zero they leave small residuals v. 
Miiller adopted the Method of Least Squares, an account of which will 
be found on p. 44 in Chapter II. 

In passing a note must be made of the fact that Miiller assumed 
tacitly that these observation equations, being each based on the same 
number of experiments, are of equal importance or " weight*." We shall 
allow this assumption to pass for the present but shall return to it later. 

Unfortunately, the equations are very far from being simple and 

* There is unfortunately a, possibility of ambiguity here owing to the fact that 
weights are used as stimuli. 
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linear as in the example on pages 44 and 45. To avoid this difficulty, 
we look up in tables of the Probability Integral* those values of 

Y = h{s-T) (4) 

which correspond exactly to our values of f. 

These equations are not yet Unear in T and h, though much simpler. 
If however we write c — hT 

they become y — As + c = (5)-j- 

and are now linear in h and c. If we now insert any pair of values 
A and c into these seven (or in general, m) equations, these also will 
leave residuals u, different from those considered above in connection 
with the equations (3). If we were now to proceed to make S (m^) 
a minimum, this would not effect our purpose. It is S {v^) we wish to 
make a minimum, not S (m^). If however we can find multiphers or 
"weights" M such that each 

Mu^ = v^, 
then we can make 8 (Mu^) 

a minimum. That is, we can apply Least Squares to the equations (5) 
weighted with certain artificial weights (in addition to any weights 
which may possibly be necessary by reason of there being different 
numbers of experiments at the different stimulus-values). The use of 
this device of artificial weights to overcome the complexity due to the 
non-hnear equations is Miiller's particular credit in this connection. 

Clearly the residuals v, which may be regarded as errors in p, are 
connected with the residuals u, which may be regarded as errors in y, 
by the equation j 

from equations (3) and (4). Therefore 

M = e-^y'/TT. 

Herein we can omit the it since it is only the relative values of the 
Miiller weights which are of importance. These weights are, by reason 
of the improved weights to be shortly described, of only historical 
interest. The condition that S (v^) should be a minimum has now 

* The table of the Probability Integral commonly used by psychologists is that known 
as Fechner's Fundamental Table, which is given in Appendix I. It is desirable that 
psychologists should make the slight changes in notation necessary to enable them to 
use better and more generally accessible tables. For example, the first table in Pearson's 
Tables for Biometricians and Statisticians is, except for a factor ■\/2, identical in significance 
with Fechner's, and gives more values, to more decimal places. 

f Note that this, Uke (3), represents a set of equations, each of this form. In our 
example there are seven such equations. 

5—2 
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become that S (Mu^) should be a minimum. With this substitution, 
the equations (5) give the two Normal Equations 

S (Msy) - S (Ms^) h + S (Ms) c = [ 
S {My) + S (Ms) h-S{M)c = 
Thence we have 



•(6). 



S (Ms) S (Msy) - S (My) S (Ms^) ^ 
'^~ S{M)S (Ms^) - S (Ms)^ 



.(7). 



, S{M)S (Msy) - S (Ms) S (My) 

S(M)S (Ms^) - S (Ms)^ 
T = c/h 

The use of these formulae is best explained by an example. Before 
giving one, however, it is well to describe a modification in the weights M 
which was introduced in 1909 by Professor F. M. Urban*, as it is with 
Urban's weights, and not with Muller's, that we shall actually work. 

These alterations in the Miiller weights, or rather additions to 
Muller's weights, which Professor Urban made, arise from the notion 
of the probability of a certain judgment, with which we are already 
familiar. The analogy is, as it were, between extracting the answers 
heavier or lighter from a subject, and extracting black or white balls 
from a bag containing a mixture of these colours. Compare, for example, 
the two statements: 

(1) From a bag containing black balls and white balls, 450 drawings 
are made, one at a time, the ball being returned each time before the 
next drawing is made. 403 black balls are observed out of the 450. 

(2) A subject on performing a certain experiment with lifted weights 
sometimes gives the answer heavier, sometimes some other answer. On 
one occasion, when the weights were 100 grams standard and 104 grams 
unknown, this experiment was repeated 450 times, and the answer 
heavier was obtained 403 times out of the 450. 

Now if p is the observed proportion (here 403/450) of black balls in 
a bag, then the probable error of p is known to vary with Vp (1 — p), 
or Vpqii. With the same sized sample, a result p= -6 has a larger 
probable error than a result p = -8 say. If anything similar holds, as the 
analogy suggests, for the psychometric experiment, then the seven, or n, 

* "Die psychophysisoheu Massmethoden," Archivf. d. gea. Psychol. 1909, XV. and xvi. 
p. 357 et seq. 

•j- Cf. p. 32, Standard Deviation of the Binomial Expansion. Really the true values 
of p and q should be used, but this is the best we can do. And fuither, the expression 
probable error ceases to have an accurate meaning when p is too close to zero or unity 
and the distribution is in consequence very skew. But these refinements do not affect 
the argument except in detail. 
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equations (5) are not equally reliable, even though based on the same 
number, 450, of experiments each. In addition to the Muller weights M 
they need other weights 'i-jiTpq to allow for this new variation in reliability. 

These weights, it will be observed, arise from the fact that drawing 
balls singly from a bag in this way gives rise to a binomial distribution, 
and the standard deviation of such a distribution is, as is shown on 
p. 32 in the previous chapter, equal to Vfq. The combined weights 
M/ip^ are known as Urban's weights, and are also given in a table in 
Appendix I. Professor Urban discusses the matter at some length in 
his already cited article, and a discussion will also be found in Wirth's 
Psychofhysik (Leipzig, 1912), where on p. 151 the actual scatter of 
various p's is given in a diagram. 

The fact that in the above we have taken the weights as proportional 
inversely to the square of the probable error, fq, need cause the reader 
no trouble, for it is the same phenomenon as the fact that the accuracy 
of a set of readings increases with the square root of the number of 
readings made, as follows from p. 24. The weight of an observation 
equation in an ordinary sense is simply the number of times the observa- 
tion has been repeated, that is it is proportional to n the number of 
observations, which as we have just said is proportional inversely to 
the square of the probable error. 

Using the symbol W for the combined Urban and Muller weights 
which are given in tabular form in Appendix I, we have to replace M 
by W in the equations (6). These equations we shall now illustrate 
by giving at some length the calculations indicated by them in the case 
of Urban's Subject I, answers heavier. 

We first look up in Pechner's Fundamental Table*, i.e. a table of the 
probability integral, those values of y which correspond to the observed 
values of p. These are given in the third column of the adjoining table. 
The fourth column of that table gives the values of the Urban-Miiller 
weights W, found from the table in the Appendix. 

Urban's Subject I, Heavier answers 
Values of y and W for substitution in the normal eC[uation3 



las s grams 


P 


7 


W 


Wi 


84 


-0022 


-20150 


0-025 


2-10 


88 


•0200 


-1-4520 


0-187 


16-46 


92 


-0889 


-0-9528 


0-502 


46-18 


96 


■2222 


-0-5408 


0-806 


77-38 


100 


•4133 


-01549 


0-982 


98-20 


104 


•8956 


0-8888 


0-551 


57-30 


108 


•9400 


1-0993 


0-396 
3-449 


42-77 
340-39 






* See Appendix I. 
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From this table S (W) is at once obtained, and the other quantities 
appearing in equations (6) are easily formed, though the arithmetic 
is laborious. The work for S (Ws) is given in the last column of the above 
table, and the other quantities are obtained similarly. They prove to be 

S(W)= 3-449, 

S{Ws)= 340-39, 

SiWsy)= -31-223, 

S {Wy) = - 0-463, 

^[(1;Fs2) = 33700-1. 

The equations (7), reading W instead of M, then give 

Threshold T = 99-68 grams. 

Precision h = 0-136113. 

The standard deviation corresponding to this precision is 

'' = V^ = ^-^^^'^^' 

and •6745ct = 3-5 grams. 

The fact that 99-68, which is thus found for the threshold of answers 
heavier, is actually smaller than the standard 100 grams than which it 
is judged heavier, may cause confusion if it is not at once explained 
that this value involves a time error. 

It will be seen that the Constant Process as described above and as 
illustrated by this example, involves a great deal of arithmetical work, 
so much so indeed that it is certain never to be used except in some 
special cases, unless plans for easing this labour be adopted. Something 
can be done by using the arithmetical short-cuts explained in the pre- 
ceding chapter, and Crelle's Calculating Tables, or better still a calcu- 
lating machine, makes the work practicable. But the best device for 
reducing the arithmetical work involved in the Constant Process is that 
adopted by Professor Urban in pubHshing his tables for this method. 
These tables are given in Appendix I, and do away with the necessity 
for Fechner's Fundamental Table and the Table of the Mtiller-Urban 
weights. They assume that exactly 100 experiments have been made 
at each stimulus-value, but of course if other numbers of experiments 
have been made, the p's can be approximated to by two significant 
figures. Their use will be readily grasped from the worked example on 
p. 73 below: and that example is also employed in Appendix I to 
explain Eich's useful Checking Table. 
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We give next, as models, the calculations by all the above methods 
for Eiecker's data, assuming however that no two-point judgments were 
given at zero stimulus-distance. The data are given on p. 58 and it 
should be noted that the stimuli are not equidistant, and that therefore 
summation methods cannot be used to lessen the arithmetical work. 

Riecker's Data 

(1) Limiting Process. By Urban's Formula 
(a) Just Perceftibly-two Points 



s 


P 


? 


q products 


P 


Ps 


Ps' 





•00 


100 


1-0000 


■0000 


•0000 


■0000 


0-5 


•10 


•90 


■9000 


■1000 


■0500 


■0250 


1 


•14 


•86 


•7740 


■1260 


■1260 


■1260 


1-5 


■40 


•60 


•4644 


•3096 


■4644 


■6966 


2 


•65 


•35 


■1625 


•3019 


■6038 


b2076 


3 


•80 


■20 


■0325 


•1300 


■3900 


M700 


4 


•87 


•13 


■0042 


•0283 


■1132 


■4528 


5 


•96 


•04 


■0002 


•0040 


■0200 


■1000 


6 


1-00 


•00 


■0000 


•0002 


■0012 


■0072 


Sums 


l^OOOO 


1^7686 


3^7852 



T = sum of Ps = 1-7686 Paris Lines. 
Standard deviation of the just perceptibly-two points, squared, 
= sum of Ps^, less T^ 
= 3-7852 - 3-1279 = -6573, 
whence standard deviation = -81 Paris Lines. 

(6) Just Imperceptibly-two Points 



a 


P 


p products 


P' 


P'a 


PV 





■00 


■0000 


■0024 


■0000 


■0000 


0-5 


■10 


■0024 


■0219 


■0109 


■0055 


1 


■14 


■0243 


■1494 


■1494 


•1494 


1^5 


■40 


•1737 


■2606 


■3909 


•5863 


2 


■65 


■4343 


■2339 


■4678 


•9366 


3 


■80 


•6682 


■1670 


■5010 


1^5030 


4 


■87 


■8352 


■1248 


■4992 


1^9968 


5 


■96 


■9600 


■0400 


■2000 


1-0000 


6 


100 


1^0000 


■0000 


■0000 


■0000 


Sums 


1^0000 


2^2192 


6^1766 



whence 



T' = 2^2192 Paris Lines, 
ct' = 1-12 Paris Lines, 
(T + T')/2 = 1-99 Paris Lines, 
Mean of the two a's, 0-97 Paris Lines. 
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(2) Linear Interpolation 
for 75 %, 50 % and 25 % points. 
80-75_3-(?, ^^^^.^^^ 

T = 1-70, 
Qx = 1-21, 
(skew). 

Interquartile Range = 1-46. 
Semi-interquartile range = 0'73. 
{Qi+ T + Q^ys = 1-86 Paris Lines. 

Rough value for a, 0-73/0-6745 = 1-08 Paris Lines. 
(3) Arithmetical Mean (Spearman's Formula) 



75 


-65 


Oa 


-2' 


65 


-50 


2 


-T 


50 


-40 


T- 


-1-5' 


40 


-25 


1-5 


-<3, 


25 


-14 


<3i 


-1 




Q?.-T 


= 


0-97) 




T -Q: 


= 


0-49 



s 


? 


dp 


centre «' 


dp xs' 


dp X s'' 





■00 


•10 


•25 


•0250 


■0062 


0-5 


•10 


•04 


•75 


•0300 


•0225 


1 


•14 


•26 


1^25 


•3250 


•4063 


1-5 


•40 


•25 


1^75 


•4375 


•7656 


2 


•65 


•15 


2^5 


•3750 


•9375 


3 


■80 


•07 


35 


•2450 


•8575 


4 


•87 


•09 


4^5 


•4050 


18225 


5 


•96 


•04 


55 


•2200 


b2100 


6 


100 


•00 












100 


Sums 


2^0625 


6^0281 



Threshold T = 2-0625 Paris Lines. 

Square of standard deviation = 6-0281 - T^ = 1-7742, 

Standard deviation = 1-33 Paris Lines*. 



* Without Sheppard's correction, for which see p. 84. 
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(4) Constant Process 
using Urban's Tables. (Appendix I^ p. 194) 



s 


working s 


P 


W 


yW 


sW 


sW 


syW 





-6 


-00 


•0000 


-0000 


■0000 


-0000 


-0000 


0-5 


-5 


•10 


■5376 


-•4871 


-2-6878 


13-4388 


2-4356 


1 


-4 


-14 


-6463 


- -4937 


-2-5853 


10-3413 


1-9749 


1-5 


-3 


•40 


•9768 


--1750 


-2-9306 


8-7916 


-5252 


2 


-2 


-65 


-9473 


-2581 


-1-8945 


3-7890 


--5163 


3 





-80 


-7695 


-4579 


-0000 


■0000 


-0000 


4 


2 


■87 


-6215 


-4950 


1-2430 


2-4860 


-9900 


5 


4 


-96 


-3036 


-3759 


1-2146 


4-8582 


1-5036 


6 


6 


1-00 


•0000 


•0000 


-0000 


•0000 


•0000 






Sums 


4^8026 


1-5869 
-1-1558 

-4311 


2-4576 
- 10-0982 

-7-6406 


43-7049 


7-4293 
-•5163 
69130 



= - 1-96 



= 2*02 Paris Lines, 



- 7-64 X 6-91 - -431 x 43-7 
~ 4-80 X 6-91 + -431 x 7-64 
in working units from the working origin 
_ „ 1-96 

, 4-80 X 6-91 + -431 X 7-64 „,_ . ,. 

* = 4-80 X 43-7 - 7-64^ = "^^^ '" ^'''^''^ ^°^*'' 
whence a — l/{\/2h) = 2-93 working units 

= 1-46 Paris Lines. 

Titchener, using the Constant Process with Miiller weights alone 
(the above is with the Miiller-Urban weights), obtained 

T = 1-88 Paris Lines, 

h = 0-49, 

whence a— 1-44 Paris Lines. 

Summarising in one table we have: 

RiecJcer's Data calculated by different Processes 

Process Threshold T Scatter a 

Limiting Process 1^99 0^97 



Linear Interpolation 


1-86 


1-08 


Arithmetical Mean 


2-06 


1-33 


Constant Process 


2-02 


1-46 



T?ie Probable Error of the Thresholds calculated in these different ways. 
The values of the standard deviation given in the table immediately 
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above refer of course to the variation of the individual limina of which 

the threshold T is a central measure. If there are n of these individual 

limina, then in an ordinary way the arithmetical mean of these would 

have a standard deviation of a/\/n. The standard deviations of the 

above values of T are indeed of this order of magnitude, say in the 

first case „„„, ,,, ^, 

0-97/V(100) = -097, 

but to avoid misconception ought to be regarded as distinctly larger 
than this. This arises from various causes which cannot here be gone 
into. In the case of the Limiting Process there is the dependence upon 
the particular choice of stimuli, in the case of Spearman's Arithmetical 
Mean formula there is the uncertainty about the centring of the 
"tails" of the distribution, etc. 

Professor F. M. Urban, some twelve years ago*, brought forward 
reasons which in his opinion showed that the Method of Limits was 
much more exact than the Constant Method: but his mathematics 
contained certain errorsf. The various processes do not differ very 
widely in this respect if each is used in circumstances favourable to 
itself, but on the whole the Constant Process is most reliable, and 
Linear Interpolation least. 

In conclusion, we may venture to express a cautious opinion on the 
choice of a process of calculation from among those given. Frequently 
of course there is no choice, for the conditions of experiment fix the 
matter for us. For example, if the just perceptible points have been 
recorded but not the frequencies p of answers of a certain kind at each 
stimulus- value, then the direct Limiting Process must be employed. 
We will suppose however that the fullest records have been taken. 

If the points at which p = and p= I, that is the points where only 
one kind of answer is given, are known or are very nearly approached, 
the Arithmetical Mean of Spearman is in our opinion best. 

If however, as is frequently the case, these points are unknown, then 
the simple linear interpolation for the 25 %, 50 %, and 75 % points is 
a good plan from the point of view of simplicity and is often of sufficient 
accuracy. 

If the accuracy of the data justifies the use of the full Constant 
Process, then Urban's Tables lighten the work enormously, and give 

* "Die psychophysischen Massmethoden als Grundlagen empirisoher Messungen," 
Archivf. d. ges. Psychol. 1909, xv. and xvi. pp. 261—415. 

t 6. H. Thomson, "Note on the Probable Error of Urban's Formula for the Method 
of Just Perceptible Differences," Brit. Journ. Psyehol. 1913, vi. p. 217 and "The Aoonraoy 
of the Phi-gamma Process," ibid. 1914, vn. p. 44. 
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perfect accuracy if exactly 100 experiments have been made at each 
stimulus. The great advantage of the Constant Process lies in the fact 
that the "tail" difficulty does not arise. Experiments in which it is on 
psychological grounds inadvisable to employ extreme stimuli can only 
be handled by this process. 

But it is not worth while applying it unless the calculator is assured 
that the experimental accuracy justifies it, and unless he has convinced 
himself, by methods to be described in the next chapter, that the 
distribution is not significantly skew, and the data not heterogeneous. 
Very few collections of psychophysical data are worth the accuracy of 
the Constant Process, which is however undoubtedly the best theoreti- 
cally for symmetrical distributions. 

(5) DIFFERENCE THRESHOLDS AND THE PROBABILITY OF A 
JUDGMENT OF A CERTAIN CATEGORY 

A question closely bound up with the mathematics of the psycho- 
physical methods is that of the best measure of a subject's sensitivity 
to differences of stimulus-value. 

To fix ideas, we shall use the case already discussed of the difference 
threshold for lifted weights. When a sufficient number of judgments 
has been collected, the three categories lighter, equal or undecided, and 
heavier, are found to occur with varying frequency with the different 
comparison weights. The difference threshold is then decided by the 
positions of the points T and T' where the descending lighter and 
ascending heavier curves cross the halfway line (see Fig. 7, p. 59). The 
distance (T — T')/2 or some closely similar quantity is what is called the 
difference threshold, and is commonly used in comparing the sensitivity 
of different subjects. The smaller T — T', the more sensitive the subject 
is said to be. 

This distance however depends entirely on the subject's readiness 
to give the answer undecided. It measures therefore rather a moral 
characteristic than a physical sensitivity, and varies very much with 
the instructions given to the subject. The moral character of the 
measure T — T' is above all seen from the fact that any subject who 
wishes may reduce it to zero, whatever may be his actual sensitivity to 
differences of weight, simply by determining that he will never give the 
answer undecided. 

There is however another measure which has been used. This can 
be most conveniently described by considering first a case in which a 
subject gives no undecided answers. In such a case, the thresholds T and T' 
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have come together and on the previous plan the subject's sensitivity 
would be considered as infinite, and all subjects giving no undecided 
answers would have the same infinite sensitivity: whereas clearly the 
subject's sensitivity is connected with the rapidity with which the 
curves pass from to 1 or vice versa, and two subjects may differ very 
much in this respect even although they both give no undecided answers. 
Under these circumstances a measure which has been used is the distance 
Q — Q', the horizontal distance between the crossing of the -25 and -75 
lines (see Fig. 7, p. 59). Under another guise it was used by Fechner also 
for the cases where undecided answers were given. In such cases he 
reduced the three curves to two by sharing the undecided answers 
between heavier and lighter. 

This measure has the advantage that the subject cannot increase 
his apparent sensitivity at will, as was the case with the "threshold" 
measure. Q — Q' is the interquartile range of the point of subjective 
equality, represented by the crossing of the heavier and Ughter curves. 
It and the difference threshold measure distinctly different things, 
and subjects placed in order of merit by the one will be found in a 
different order by the other. 

The points here raised seem to suggest an extension of Urban's idea 
of the probability of a judgment, which compares the giving of the 
judgments, heavier, undecided or lighter, with drawing a ball from an 
urn containing say red, white, and blue balls, and ascertaining its colour. 
For each stimulus the urn is supposed to contain different porportions 
of the coloured balls. 

In place of this is suggested the following. For each stimulus 
imagine an urn containing an infinite number of balls some black and 
some white, in a proportion varying in some way with the stimulus. 
A judgment may then be compared with taking not one but a handful 
of balls from the urn, the hind of judgment depending upon the pro- 
portion of black balls in the handful. 

From this point of view, the standard weight, the variable weight, 
and the physiological make up of the subject decide the proportion of 
black balls in the urn: but the decision as to what proportion is to be 
called heavier, what undecided, and what lighter, depends upon a con- 
scious act of the subject. 



CHAPTER IV 

SKEWNESS AND HETEEOGENEITY IN PSYCHOPHYSICAL DATA 

Obvious skewness of many psychophysical curves — Pearson's test for goodness of 
fit applied to the method of average error — Applied to the method of right and 
wrong oases — Skew curves in homogeneous material — The summation method of 
finding moments — Calculation of a skew curve — Analysis into two normal curves — 
Conclusions. 

(1) OBVIOUS SKEWNESS OF MANY PSYCHOPHYSICAL CURVES 

To anyone accustomed to handling distribution data, a most striking 
point about the results of many psychophysical experiments is the 
obvious skewness of much of the data. Both the examples used ex- 
tensively in the previous chapter show this very strongly as can be seen 
from an inspection of the data either in numerical or diagrammatic 
form, and also from the various values of the threshold T found by the 
different processes of calculation : for these differences arise largely from 
the fact that the distribution is not normal. 

Taking the best of the processes, the Constant Process, it is of 
interest to see how closely the curve which it gives fits the original data. 
A method of thus estimating the goodness of fit of curves has been given 
by Professor Karl Pearson. His method is perfectly general, and 
apphcable to all classes of curves*, but it has been most fully worked 
out for the fitting of bell-curves to histograms. Our problem is not of 
this nature, though it might appear to be so, for the pseudo-histogram 
(Fig. 8) which can be formed from the frequencies f differs essentially 
from a real histogram. Since in psychophysics it may often be necessary 
to fit curves to real histograms, for example those obtained in the 
Method of Average Error, we shall first explain Pearson's Goodness of 
Fit Test for this case, using the bisection data of Chapter II for the 
purpose (see pp. 15 and 42 and Fig. 6). 

(2) PEARSON'S TEST FOR GOODNESS OF FIT APPLIED TO THE 

METHOD OF AVERAGE ERROR 

The bisection data had a mean of 60'13 mms. and a standard deviation 
of 1'38 mms. With these values, using Sheppard's Tables, we draw the 
smooth curve shown in Fig. 6. Now it is important at the outset to 

* Phil. Mag. July 1900, Fifth series, l. pp. 157—175. Phil. Mag. April 1916, Sixth 
series, xxxi. pp. 369 — 378, 
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realise that whether that curve is a good or bad fit to the data depends 
on the number of observations made. The number in this case was 
only 29, and it will presently be shown that the curve is a very good fit. 
But had the number of observations been 2900 it would have been a 
bad fit, for with such a number of observations the histogram ought to 
have modelled itself more closely to the curve. 

In order to apply Pearson's test, we must find the theoretical histo- 
gram for comparison with the observed histogram. That is, we must 
find the areas of the slabs of the curve in Fig. 6 which replace the 
rectangles. (One of these slabs is cross-hatched in that figure to explain 
more clearly what is here meant.) This is most easily done from 
Sheppard's Tables by calculating the areas of the smooth curve from 
— 00 up to each dividing ordinate in turn, and taking the differences 
of these numbers, as is done in the following table. The quantity 
I (1 -|- a) in Sheppard's Tables is the area of a Normal Curve, of unit 
total area, up to the ordinate xja. For negative x'a it has to be subtracted 
from unity. 



Calculation of the Theoretical for comparison with the Observed 
Histogram of the Bisection Data 



X mms. 


x'=x-m-lZ 


Sheppard's 
a;=a;71-38* 


Sheppard's 


Multiplied 
by 29* 


Differences 












009 


63-95 


3-82 


2-77 


•997 


28-91 


0-52 


62-95 


2-82 


2-04 


•979 


28-39 


2-09 


61-95 


1-82 


1-32 


•907 


26-30 


5-36 


60-95 


0-82 


0-59 


•722 


20-94 


7-95 


59-95 


-0-18 


-0-13 


-448 


12-99 


7-33 


58-95 


-1-18 


-0-86 


•195 


5-66 


401 


57-95 


-2-18 


-1-58 


•057 


1-65 


1-36 


56-95 


-3-18 


-2-31 


•010 


0-29 


0-29 




2900 



The actual and theoretical histograms are then compared in the next 
table. Pearson's method, the theory of which cannot be given here, 
then forms the quantity 

2 „ /square of differences of theoretical and observed f requenciesN 
^ ~ V theoretical frequency / ' 

With this value x^> ^^^ '"•' tte number of cells in the histogram. 
Table XII in Pearson's fafefes is then entered and a value of P found. 

* These can be calculated at one opening of Crelle's Tables, or by one setting of a 
slide rule. 
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n' is here 9 counting in the two tail cells. P is then the probability that 
the observed or a worse distribution will be obtained (assuming the 
theoretical distribution) in a sample of the size taken, here 29. 

Calculation of Goodness of Fit of Normal Curve to the Bisection Data 



mms. 


Actual observa- 
tions m 


Theoretical m' 


(m - m'flm' 


Above 63-9 





0-09 


-090 


83— 63-9 


1 


0-52 


-443 


62—62-9 


2 


2-09 


-004 


61—61-9 


6 


5-36 


-076 


60-60-9 


6 


7-95 


-478 


59—59-9 


8 


7-33 


-061 


68-58-9 


5 


4-01 


•244 


57—57-9 


1 


1-36 


-095 


Below 57 





0-29 


-290 




29 


2900 


1-781 =x» 



No. of cells w' = 9. 

From Elderton's Tables, No. XII in Pearson's Tables, 

P = 0-998 for x^=l, 
0-981 for x^ = 2, 
therefore P = 0-99 approximately. 

In our case we find P = 0-99, i.e. 99 samples of this normal distri- 
bution out of 100 would give no better a fit than the present. The 
Normal Curve is therefore a perfectly satisfactory theory for thes8 
29 observations. 

(3) PEARSON'S TEST OP GOODNESS OP PIT APPLIED TO THE 
METHOD OP BIGHT AND WRONG CASES 

The above plan of testing goodness of fit cannot however be apphed 
to the pseudo-histogram of Fig. 8*. The reasons why this is so cannot 
be here gone into in detail, but they are based upon the following 
difEerences between the two cases. In a real histogram, if any one of 
the cells is larger than it ought to be, then any other must have a 
tendency to be smaller than it ought to be. There is a strong negative 
correlation between the numbers in the cells, a correlation, that is, from 
trial to trial. In the psychometric pseudo-histogram, however, formed 
from the proportions p, this is otherwise, because the p's are measured 
quite separately from one another. In a real histogram the numbers in 

* G. H. Thomson, "The Criterion of Goodness of Fit of Psychophysical Curves," 
Biomet/rika, 1919, xn. pp. 216—230. 
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each cell are necessarily positive quantities. In the psychometric 
pseudo-histogram they may be negative, if the p's do not rise steadily. 
Psychometrical data of the kind here considered, in fact, as has 
already been pointed out, are not really in histogram form. Although 
a kind of histogram can be deduced from them, it is only by making 
certain assumptions, and the intercorrelations of the cells of this arti- 
ficial histogram are different from the intercorrelations of a naturally 
observed histogram. Under these circumstances we must in applying 
the goodness of fit test turn to the directly observed quantities p. These 
are compared with the values calculated from the Constant Process in 
the following table, the remaining columns of which are explained below: 



Calculation of Goodness of Fit of a Constant Process Ogive 
Urban'a Subject I, answers heavier 





Observed p 


Calculated p' 


p-p' 


{p -p')VpY 


84 


•0022 


•0013 


•0009 


•001 




88 


•0200 


•0122 


•0078 


•005 




92 


•0889 


•0697 


•0192 


•006 




96 


•2222 


•2394 


-•0172 


•002 




100 


•4133 


•5246 


-•1113 


•050 




104 


•8956 


•7972 


•0984 


•060 




108 


•9400 


•9454 


- ^0054 


•001 






■125 = 8{(p- 


-pT/pV} 



X^ = 450 X -125 = 56-25, 

n' = 8 (one more than the number of stimuli), 

P = -0000005 from Pearson's Tables. 

To these quantities p the principles underlying Pearson's test can 
be applied direct. They are indeed the same principles already used in 
Chapter II when we were discussing curve fitting. We have n quantities 
p which are independently measured, and n quantities p' which are 
theoretically given. The variations of p from p' are binomial in form, 
that is approximately normal. If we look upon the judgment heavier, 
as suggested in an earher paragraph, as being comparable with drawing 
black balls out of a bag containing black balls and white balls in the 
proportions p' and 1 — p', then the probable error of p is 

-6745V{p'(l -/)/*}, 

k being the number of judgments of which pk are of the category heavier. 
For the chances of obtaining 0, I, 2, ... , k— I, oi k black balls in 
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a drawing of k are given by the terms of the binomial (p' + q'f, q' being 
1 — p' ; that is, the chances of obtaining 

f = Ojk, Ijk, ... {k— l)/k, k/k or unity. 
The standard deviation of the above binomial is ■s/i^P'Q')^ and the 

standard deviation of p therefore t-\/(^p'2')^ or Vip'i'/^)- The proba- 
biUty of an error p — p' ia therefore 

yk -2^'^p-p')' 
V(^p'q') ' 

The probability of the whole set of observed values p^, p^, ... p„ 
occurring is the product of n such factors, and is of the form 

where ^^2 = ^ jjfc (^JZ^Tl 

or if jfc is the same at each stimulus, 

The remaining columns in the above table calculate this quantity*, 
which is the same as Pearson's x^, under our special circumstances. 
For n' we have to use n+ 1, because all the n values of p can vary 
separately, whereas in a real histogram the number of variables is one 
less than the number of cells. We thus reach the value P = -0000005, 
that is, the curve is an incredibly bad fit to the data, which cannot 
possibly be regarded as differing from a normal distribution by sampUng 
errors alone. 

(4) SKEW CURVES IN HOMOGENEOUS MATERIAL 

There are two immediate hypotheses which present themselves to 
explain this bad fit of the normal curve, (a) that the material is not 
homogeneous, the conditions of experiment not having remained the 
same throughout, (6) that the material is homogeneous, but the under- 
lying factors which cause the distribution of errors or deviations are 
not independent, but correlated, as in the system of generaUsed proba- 
biUty curves next to be described. A word of caution may first be given, 
for, as we have already suggested, the identification, or even the 
approximation, of graphical representations of "percentage" judgments 
to true frequency-distributions is of extremely doubtful validity. 

* A table in Appendix I, giving values of l/ipq), Hghtens the work conaiderably. 
B. &T. 6 
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The general theory of curve fitting has been worked out in great 
detail by Professor Karl Pearson*. 'A good account of his method is 
given in a book by W. Palin Elderton, Frequency Curves and Correlation, 
C. and E. Layton, London, 1906, to which the mathematical reader is 
referred for fairly complete theoretical and practical information. 

Frequency curves of data not involving a mixture of species tend 
to commence at zero, rise to a maximum, and then fall either at the 
same or at a different rate. There is often high contact at one or both 
ends of the distribution. An equation of the general form 

dy ^ {x + a)y 
dx f (x) 

satisfies both these conditions, since ii y = 0, dy/dx — (high contact), 
and if x— — a, dy/dx = (maximum ; for a maximum, again, the 
second differential coefficient must be negative). Expanding / (a;) by 
Maclaurin's Theorem, we have 

^_ (a: + a) y 

dx Cq + c^x + CgX^ + c^a^ + ..." 

(1) Putting Ci = C2 = Cj = ... = 0, we have 

1 dy x + a 

ydx" Co ' 
which is the Gaussian or normal curvef. It fits the symmetrical binomial 
(I + 2)"; 6.g. in coin tossing, where the chances for and against are 
equal (p = q), and the contributory causes are independent of one 
another. 

(2) Putting C2 = Cg = ... = 0, we have 

1 dy X + a 

y dx Cq + CjX ' 
which represents a class of curves varying from the Gaussian curve to 
the J-curve. It fits the asymmetrical or "point" binomial {p + q)", e.g. 
in teetotum spinning or dice throwing, where the chances for and 
against are not equal, p=f=q, but the contributory causes are still 
independent of one another. 

* Karl Pearson, "Skew Variation in Homogeneous Material," Phil. Trans. 1895, 
oIjXXXVI. a, pp. 343 ff. ; "On the Systematic Pitting of Curves to Observations and Measure- 
ments," Biometrika, i. pp. 265 ff. and n. pp. 1 ff., 1901 — 3; "On the Curves which are 
most suitable for describing the frequency of Random Samples of a Population," 
Biometrika, 1906, v. pp. 172 — 5 (an exceedingly clear summary of the principles involved). 
Also later papers in Phil Trans. 1901, oxovn. A, pp. 443—459, and 1916, being supplements 
to the first mentioned memoir. See also pp. Ix to Ixx in Pearson's Tables for Statisticians 
and Biometricians, Cambridge, 1914. 

t Compare equation (5), Chap. II, p. 34. 
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(3) Putting Cg = C4 = ... = 0, we have 

Idy _ X + a 
y dx 



Co + CyX + C^X'' 

which can be made to represent almost all the frequency distributions 
which may arise. It fits the hypergeometrical series, the successive 
terms of which, e.g. give the chances of getting ft, A — 1, ... black balls 
from a bag containing fn black and qn white balls when k balls are 
drawn*. 

Here the contributory causes are not independent of one another. 

There is no advantage in employing equations which involve Cg and 
higher constants, because their use necessitates the calculation of the 
6th and higher "moments," and these have very high probable errors. 

Definition. The nth moment coefficient {fi„') of any distribution about 
any ordinate is the sum of the products of the partial frequencies and 
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the nth. power of the distances of these frequencies from the ordinate, 
divided by the total frequency. In symbols, if N be the total frequency. 



NiJ,„' -= j afySx. 



The moments are, in practice, first calculated about any arbitrary 
ordinate that is most convenient, and then reduced to moments about 
the centroid in the following way (dashed /n's represent moments about 
an arbitrary ordinate, undashed /a's about the central ordinate) : 

n(n — 1) 



NiJL„ = j (x- xYyhx = N{in - nxNfj.'„_i + 
This gives the general reduction formula 



1.2 



■i^N,,'„. 



f^n 



* The series is 

pn(pn-l) ... {pn - fe + 1) j. 
n(n-l) ...(n-k + l) \ 
Other series may arise. 



kqn 



n{n — 1) 
1.2, 

fc(fc-l) 



/*! f^ «-2 ■" 

qn {qn - 1) 



pn-k + l 2! ' {pn-k + l){prk-k+2) 



...}. 



6—2 
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which enables us to transfer any moment from an arbitrary ordinate 
to the mean. Thus we have: 

Ma = A*2' - Mi'^ 

Mi = M4' - ^Mi'ms' + ^(^I'W ~ W*- 
The symbols [i represent moments of the curve; but we have to 
start with grouped frequencies, where the frequencies are assumed to 
be concentrated along the mid-ordinates of the rectangles (cf. Fig. 105). 
The moments obtained from these grouped frequencies are denoted by 
v'b (dashed and undashed), and corrections are necessary. These have 
been deduced by Sheppard* and are consequently known as Sheppard's 
adjustments. They are: 

M2 = "2 - iV. 

It is generally said that they are only valid when there is "high 
contact" at the ends of the frequencies, but the equations for jug and /ig 
are probably still vahd even without high contact, it the terminal 
frequencies are zero. 

(N.B. In working, ^''s are changed into v's before applying Sheppard's 
corrections.) 

It is obvious that I'o = 1 and v^ = 0. v^ is the distance of the mean 
or centroid vertical from the arbitrary ordinate about which the moments 
are first taken, and is conveniently known as d. 

Two very important constants in curve fitting are 

^, = ^;,and^, = i^. 
Ma Ma 

The values of these are always to be calculated, and, within the hmits 

of their probable errors, they fix the type to which the curve belongs. 

The general frequency curve equation, written in terms of moments, is 

^.^V^i (^a + 3) 

ldy_ ^ 2 5j3a-6^i-9 

ydx- A 4^2-3^1 V^, (^, + 3) X 2^, - 3;3, - 6 /xy) • 
(10^2-12j3i-18'*' 2 5|82-6j8i-9'a"^10;82-12^i-18VCT// ) 

* W. F. Sheppard, "On the Calculation of the most Probable Values of Frequency 
Constants, for Data arranged according to Equidistant Divisions of a Scale," Proc. Lond. 
Math. Soc. XXTX. pp. 363 fi. Karl Pearson, "On an Elementary Proof of Sheppard's 
Formulae for correcting Raw Moments and on other aUied Points," Biometrika, 1904, m. 
pp. 308 £E. 
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This gives at once, for the distance between the mean and the mode, 

2(5;82-6;8i-9) 

(origin is at mean). 

Hence the curve is symmetrical if ^Sj = 0. If yS^ = and ^^ = ^> ^^^ 
curve reduces to the Gaussian or Normal Curve, since the terms in- 
volving X in the denominator of the right-hand side of the general 
frequency curve equation then vanish. 

In using the Pearsonian method, then, the order of procedure to be 
adopted is : 

(1) Calculate the moment coefficients v^', v^, v^', v^ about a con- 
venient arbitrary ordinate, 

(2) Transfer to the mean by the equations 

"2 = "a' - "i'^ 

I/3=.'3'-3viV2'-l-2V3, 

(vi or d is the distance of the mean from the arbitrary ordinate.) 

(3) Determine the corresponding moments for the curve by the 
equations 

^2 = i-z - iV I 

/tj = cj > Sheppard's corrections. 

/*4 = "4 - ka + rfrri 
(N.B. For these corrections to be applicable, two conditions must be 
fulfilled: 

(i) there must be high contact, 

(Li) the grouping of the frequencies must be eqtial.) 

(4) Calculate /Sj and ^^ ^7 t^® equations 

o ^ f^3 o _ M4 
Pi- JTa' Pi~ 772- 

These results give the distance of the mean from the arbitrary ordinate 
(i/^' or d) ; the standard deviation (V/^a) ! ^^<i ^^^ mode 

<yVPiW2 + ^) 
°'^^''-2(5)8,-6;8i-9)- 

The median is more difficult to determine exactly, but a position 
which is approximate and indeed very accurate in all the curves we are 
likely to meet with in this work, is between the mean and the mode, but 
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nearer to the mean, so that the distance from the mode is twice the 
distance from the mean*. 

Mean Median Mode 



1/3 2/3 

The investigator may then proceed to determine to which of the 
Pearsonian "types" the particular curve belongs, to find its equation, 
and to plot it. The type is decided by the constants jSj and jSg (using 
Diagram XXXV, p. 66, Pearson's Tables) or by the criterion k^ where 

)8, ip, + 3)^ 
' 4 (4;S3-3)8i) (2^3^-3^31 -6)' 

using the following diagram: 



[ II if /3j < 3, 
= 3, 
>3. 



Type J G „ =3, 

Ivii 



Type IV 



Typel 




TypeV 



Type VI 



The equations to the types are given in detail in Pearson's Tables, 
p. Ixiii, or in Elderton (op. cit.), where valuable advice on the arrange- 
ment of the calculations will be found. Elderton's Type II includes 

* Karl Pearson, Biometrika, i. 1902 — 3, p. 265; Phil. Trans, clxxxvi. A, p. 375; Boy. 
Soc. Proc. LXVin. p. 369b. C. V. L. Charlier, "Researches into the Theory of Probability," 
Lunds Univeraitets Arsahrift, 1905 (1), 1, equation 9. Arthur T. Doodson, "Relation of the 
Mode, Median, and Mean in Frequency Curves," Biometrika, 1917, xi. p. 426. 
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both our Types II and VII, and his Type VII is our G, the Normal 
Curve. 

We shall confine ourselves to a fuller description of Type IV, which 
appears to be the type most common in psychometric skewness*. 

It will be clear from the above account that the whole of the calcula- 
tions are based upon the first four moments of the data, and we proceed 
first to describe the most convenient way of finding these when as here 
the stimuli are equidistant, viz. the summation method already used 
lor finding the mean on p. 19. If the stimuli are not equidistant the 
calculations are rather longer. 

(5) THE SUMMATION METHOD OF FINDING MOMENTS 
IN THE CASE OF DATA AT EQUIDISTANT POINTS 

A full description of this device will be found in Mr Palin Elderton's 
book already cited, where it is attributed to Mr G. F. Hardy. It has 
however been independently used by numerous writers, e.g. Lipps, 
Wirth, Urban, etc.f 

The theory cannot be worked out here, but the reader can easily do 
so for himself. It is only a question of simple algebra; or from another 
point of view it is the same thing as integration by parts. We give only 
a worked example, Urban's Subject I, heavier answers. 



Example of the Use of the Summation Method 



Grams 


P 


series 


of successive sums 


84 


•0022 


•0022 


•0022 


•0022 


88 


•0200 


•0222 


•0244 


•0266 


92 


•0889 


•nil 


•1355 


■1621 


96 


•2222 


•3333 


•4688 


•6309 


100 


•4133 


•7466 


12154 


18463 


104 


•8956 


16422 


2^8576 


47039 


108 


•9400 


2^5822 


5^4398 


101437 




2^5822 


5^4398 


101437 


17^5157 




8^ or d 


«8 


s. 


s. 



* Thirteen out of fifteen psychometric curves tried recently were Type IV. Cf. Pearson 
on zoological and anthropological curves, where Type IV also prevails, Phil. Trans. 1896, 
OLxxxvi. A, Part I, pp. 388, 403 and 411. 

t G. F. Lipps, "Die Theorie der Collectivgegenstande," Wundt's Phil. Stud. Bd. xvn. 
Separat-Abdruck, Leipzig, 1902; W. Wirth, "Die mathematischen Grundlagen der soge- 
nannten unmittelbaren Behandlung psychophysischer Resultate," Wundt's Psychol. 
Studien, 1910, Bd. vi. pp. 141, 252, 430. Urban on Wirth, Archiv f. d. gee. Psychol, xi. 
Literaturbericht, p. 1. 
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The table is self-explanatory. Each column is formed from the 
preceding one by successive summations (from the top in this case), 
and is then totalled*. The origin is here the centre of the group beyond 
108 grams, viz. 110 grams, and the unit of measurement is 4 grams, 
measured downwards from 110 grams. We have 

Mean = 110 - 4(Z = 99-6712 grams. 

Further, it can be shown that the moments 

v^ = 2S^-d{l + d), 

vs = 6/S4 - 3v2 (l + d)-d(l + d)(2 + d), 

v^ = 2iS^-2vs{2{l + d) + l} 

- v^ {6 {l + d)(2 + d)-l}-d(l + d) (2 + d){3 + d). 

The work is not heavy up to this point if arranged systematically, 
and in the present case it gives 

v^ = 1-6296, 
v^ = 0-9645, 
v^ = 9-1621, 

or, using Sheppard's corrections, 

jLtg = 1-5463, a = V/^a = 1-2435, 

JLI3 = 0-9645, 
JLI4 = 8-3765. 

This a gives in original units 4 x 1-2435 = 4-974 grams, differing from 
the value mentioned on p. 63 because of the Sheppard adjustment. 
From the moments we obtain 

)8i = 0-251, 
J82 = 3-51, 
k^ = 0-775. 

The type is therefore Type IV; within the Umits of probable error 
of )8i, ^2j ^^^ "2 i* might however be Type G, VII, or V. Unfortunately 
the ordinary methods of finding these probable errors are of doubtful 
significance in the case of pseudo-histogram data such as ours. We turn 
to the calculation of Type IV. 

• Slightly greater accuracy could, by the way, be attained by using the actual numbers 
of answers heavier and not the proportions p in cases Uke the present where an awkward 
number of experiments was performed at each stimulus, viz. 450, leading to recurring 
decimals. The totals would then be divided by this number. 
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(6) CALCULATION OF A SKEW CUEVE (Type IV) 
The equation* is 



a 



wherein m = |(»- + 2) say, 

where r = %{p^-fi^- \)l(2^^ - Z^^ - 6), 



V{16(r-l)-^,(r 

Sk (skewness) = l/{4:m). 
Origin, at mean + va/r; 
mode, at mean — skewness x a; 



m 



rl/n say. 



Vo 



N y^r e 






m-i" 



tan j> ■■ 



a ^(2^-) (cos ^y+^ 
The actual form of the calculation depends on the appliances avail- 
able, whether Crelle's Tables, or logarithms, or calculating machines. 
Using Crelle and seven figure logarithms we get finally, after much 
labour, the following values for the ogive, which has to be obtained 
from the bell-curve by simple quadrature: 

Urban's Sttbject I, Heavier answers 



Stimulus 


Observed p 


Calculated by 
Type IV 


(p-py/ipV) 


84 

88 

92 

96 

100 

104 

108 


•0022 
•0200 
•0889 
•2222 
•4133 
•8956 
•9400 


•0046 
•0184 
•0690 
•2155 
•5006 
•8078 
•9624 


•0012 
•0001 
•0062 
•0002 
•0305 
■0496 
•0139 






Sum •lOl? 



X^ = 450 X -1017 = 45-8, P < -0000005 still. 

We find therefore that fitting the best Pearson curve possible to the 

data makes practically no improvement in the fit, which is still so very 

bad as to make it quite certain that the data are not homogeneous at 

all. The fit cannot be much improved by other assumptions as to the 

spread of the "tail," several of which have been tried. The Type IV 

* The notation on this page is that of Elderton following Pearson, and the symbols 
m, V, n and r have no connection with these symbols used elsewheie in the present book. 
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curve itself is shown, contrasted with the pseudo-histogram, in Fig. 12. 
The mean (99-67), median (99-97) and mode (100-52) are worth com- 
pariDg, as a matter of iaterest, with the thresholds obtained otherwise 
(see pp. 53, 60, 63 and 70). 

The reason for the bad fit in this case is, mathematically, the im- 
possibility of finding a curve to accommodate both the tall rectangle 
217, and the tail of 27, however the latter may be allocated. Much of 
Urban's other data shows the same bad fit, and for the same reason, the 
size of the "tails." Not all however are bad. The best case is Subject II 



Mean = 99-67 grs- 
Mode 100-52 „ 
Median = 99-97 „ 




84 88 92 96 100 104 108 

JKg. 12. A Type IV curve fitted to TJrban's data for Subject I, Jbeaviar 
answers, in pseudo-histogram form 

(Urban himself) who had had much 'practice at this form of experiment. 
The lighter answers in his case were as follows, compared (1) with a 
curve fitted by the Constant Process and (2) with a Pearson skew curve 
(here Type I). 

Urban's Subject II, Lighter answers 



Grams 


Observed p 


Normal Curve p 


Type I Curve p 


84 


•9333 


•9604 


•9432 


88 


•8622 


•8640 


•8520 


92 


•7000 


•6767 


•6875 


96 


•4489 


•4456 


•4627 


100 


•2311 


•2320 


•2379 


104 


•0956 


•0922 


•0858 


108 


•0156 


•0272 


•0187 



On testing the goodness of fit we obtain 

For Normal Curve, P = 0-48. 
For Type I Curve, P = 0-91. 
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Here the Gaussian is a good, and Type I an excellent fit. There is every 
reason then to think that the data here are homogeneous. The bell- 
curve and pseudo-histogram are shown in Eig. 13. 

Since we have decided that the data of Urban's Subject I are hetero- 
geneous, that is, that the conditions of the experiment varied consider- 
ably during its performance (which lasted several months), the question 
naturally arises as to whether we can analyse the data mathematically 
into two or more frequency-distributions. This question was discussed by 
Professor Pearson in 1894* as far as an analysis into two normal curves 
goes. One more moment, /xj, is needed and as the probable error of this is 
considerable the practical application of the plan to be described is not 
very satisfactory. 

1-200 



-100 



Tail =30. 





/ 


113 


— .^ 


K. 






98 






/ 

73 




/ 


s 

61 


s. 




^ 


32 



Jail =7 



.80 84 88 92 96 100 104 108 

Kg. 13. A Type I curve. Urban's Subject II, lighter answers 

(7) ANALYSIS INTO TWO NORMAL CURVES 
Stage I. Eind the centroid of the frequency curve and calculate 

/*2» /*3> /*4. /^5. -^4 and -^5- A4 = 9fi2^ - 8/^4, 

A5 = 30/*2f^ - 3/*5. 
Stage 11. Solve the following nonic equation for pj using Sturm's 
functions to localise roots: 

24^2" - 28A4P2' + 36/i32p,6 _ (24^03^5 - IOA42) 'p^ 

- (I48/X32A4 + 2A52) P2* + (288itx3* - 12AA/^ - A4«) f^ 
+ (24/^2A5 - liL^-K^) fi + 32,^*A4P2 - 24,.36 = 0, 
and find p^ from 



ViPi = 



* Phil. Trans. Soy. Soc. London, 1894. 
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Stage III. Find y^ and y^ the roots of 

y2 - ViY + Ti = ^- 
hy^ and hy^ are the positions of the axes of the normal component curves, 
where h is the unit of length. 

Stage IV. The fractions z^ and Zg, that the areas of the component 
curves are of the area of the whole curve, form the roots of the quadratic 

z^-z~ „P\ = 0. 

Stage V. The standard deviations are found from 

CiW = F2 - in^lVz - sPiVi + Pi> 




80 84 88 92 96 100 104 108 112 grams 

Kg. 14. FBeudo-histogtam analysed into two normal curves, with their sum. 
Urban'e Subject I, heavier answers (fit still very poor) 

The above refers to asymmetrical frequency curves. For symmetrical 
curves (where the components have the same axis) modifications are 
required. 

On carrying out this very troublesome calculation with the hetero- 
geneous data of Urban's Subject I, heavier answers, the nonic was found 
to have three possibly usable roots: but two of these led to imaginary 
quantities, leaving only a root which led to the following analysis, 
approximately: 



First component 


Second component 


Centre 98 grams 


101 -7 grams 


<T 5-73 „ 


3-54 „ 


Area 250 


200 
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The two curvea are drawn on the adjoining figure. On adding them 
and finding their areas between the appropriate points (84, 88 grams, 
etc.) we arrive at the following comparison : 





Urban's Subj 


ect I, Heavier answers 


Grams 


p observed 


p' of Ckunpound 


ip-p'r/ip'g') 


84 


•0022 


•0039 


•001 


88 


•0200 


•0228 


•000 


92 


•0889 


•0836 


•000 


96 


•2222 


•2262 


•000 


100 


•4133 


•4938 


•026 


104 


•8956 


•8031 


•054 


108 


•9400 


•9603 


•016 




•097=Bum 



X^=450x^097=43^65. 

The decrease in the value of x^ obtained by dissecting into two 
normal curves is therefore only very slight indeed. P is still only about 
•000001 in value. The heterogeneity is more complex than can be thus 
dealt with. 

(8) CONCLUSIONS 

Further analysis is useless. For since there are only seven points 
given by experiment in the curve, and also the total area, it is clear 
that an eooact fit could be obtained by two skew curves, or a skew and 
a normal, which have between them eight constants; or in many ways 
by three normal curves. 

All we can say is that this subject's sensitivity oscillated between 
at least two states; and if only two, then one of these is such as to give 
a skew curve. Without attempting to get an exact fit, it is clear after 
our experience, that provided we supply a flat normal curve to give 
the awkward tails, the surplus of the distribution could be fitted success- 
fully by a skew curve. 

This suggests that the two components into which the heterogeneous 
data are thus divided are (a) a component due to erratic answers giving 
a wide shallow distribution and (6) a component really corresponding 
to the conditions of the experiment. 

The use of "catch" tests in threshold determination is to check 
component (a). The proper way to employ them is to cancel sittings, 
at which numerous catch errors occur, the subject being then pre- 
sumably in an abnormal state. 

In Eiecker's experiment many "catch errors" were present, i.e. 
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occasions when the subject answered two to a one-point stimulus. Fitting 
the best normal curve to his data, we only get, on testing its goodness 
of fit, P = -0003. 

Riecker's Data 
Mean 2^02, ff = l ■46 Paris Lines 



Paris 
Lines a 


Observed p 


«-2^02 


«-2^02 
146 


Gaussian p' 


p-p' 


{p-py/p'q' 





•00 


-2-02 


-1-38 


■084 


-■084 


■092 


0-5 


•10 


-1^52 


-1^04 


■149 


-•049 


•019 


1 


■14 


-b02 


-0-70 


•242 


-■102 


•057 


1^5 


■40 


-©•52 


-036 


■359 


■041 


•007 


2 


•65 


-0^02 


-©■014 


•494 


■156 


•097 


3 


•80 


0^98 


0^67 


■749 


•051 


•014 


4 


■87 


1-98 


b36 


•913 


-■043 


■023 


5 


■96 


2-98 


2-04 


■979 


-■019 


■018 


6 


100 


3^98 


2^73 


■997 


■003 


•003 




Sum •SSO 



X' = 100x^330, Ji' = 10, P = ■00031. 

Compare with this the following case. The data are for the spatial 
threshold on the forearm*, and were gathered by the Method of Non- 
Consecutive Groups. Sittings containing more than a certain number of 
catch errors were rejected, the number chosen being one which when 
exceeded at all was usually exceeded violently. Other experimental 
precautions were taken which are described in the articles cited. As a 
result it is found that a roughly fitted Normal Curve is quite a fair fit, 
for in 13 out of 100 cases a worse departure would be got by chance: 
and a skew curve would improve this. The data are in fact reasonably 
homogeneous. (Calculations on opposite page.) 

The conclusion of the whole matter is that we are led to believe that 
the difficulties of psychophysical experiment are such that homogeneity 
in the data is rare. For such data refinements of mathematical calcula- 
tion are out of place. The curve fitting methods here described are 
however of value in discovering the heterogeneityf. 

With increasing precautions in carrying out the experiments and 
with increasing practice on the part of the subject, it would appear that 
the data finally reach a distribution where they are fairly well fitted by 
a Normal Curve, and excellently fitted by Pearsonian Skew Curves. 

* G. H. Thomson "A Comparison of Psychophysical Methods," Brit. Joum. Psychol. 
1912, V. pp. 203 — 241; "Changes in the Spatial Threshold at a Sitting," Brit. Joum. 
Psychol. 1914, vi. pp. 432 — 448 and B.A. Report, 1913, pp. 681—683. 

t Compare Urban's use of the Lexian Coefficient of Dispersion (op. oit.) and compare 
the latter with Pearson's more signifloant criteria /Sj, p^, and the /Sj/Sj diagram (XXXV 
in his Tables). 
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Normal Integral fitted, to Thomson's Spatial Threshold Data 


Cms. 


Two-point 
answers 


Continued 
sum 


Cms. from 
mean 


In «• 
units 


Theoretical 
p' from 
Sheppard 


Observed p 


{p-p'flp'q' 





5-2 


5-2 


-2-44 


-2-29 


■Oil 


•035 


•05293 


i 


6 


11-2 


-1-94 


-1-82 


•034 


•040 


•00109 


I 


8 


19-2 


-1-44 


-1-35 


-089 


•053 


•01599 


IJ 


21 


40-2 


-0-94 


-0-88 


•189 


•140 


•01565 


2 


56 


96-2 


-0-44 


-0-41 


•341 


•374 


•00485 


2i 


Si 


180-2 


0-06 


0-06 


•524 


•560 


•00521 


3 


105 


285-2 


0-56 


0-52 


■698 


•700 


•00002 


3J 


125 


410-2 


1-06 


0-99 


•839 


•834 


•00018 


4 


141 


551-2 


1-56 


1-46 


•928 


•940 


-00216 


4i 


144 


695-2 


2-06 


1-93 


•973 


•960 


-00644 


5 


148 


843-2 


2-56 


2-40 


•992 


•987 


•00315 




f 5 


843-2 


3137-2 








Sum =• 10767 


150 
























30 


168-64 


627-44 












5-621 


20-915 






d 


Sz 











d in cms. is 2^81 from an origin of 5^ downwards. 

Mean=5-25-2-81= 2-44 cms. 

25, =41-83 

d(l+d) = 37-20 

»2= 4-63 

Sheppard's correction 0-08 

;H;= 4-55 =0-' 

(T =2-133 
or 1-067 cms. 
X2 = 150x -10767 = 16-15, 
re' = 12, P=0-13. 



PAET II 
COERELATION 

CHAPTER V 

INTRODUCTION TO CORRELATION 

A SOMEWHAT detailed account of the mathematical theory of correlation 
and of the way in which it may be usefully applied to psychological 
measurements will be found in the later chapters of this Part. The object 
of the following introductory pages is to give the reader a general pre- 
liminary view of the method, free from mathematical complications, 
and to illustrate it by means of a simple example. 

Correlation may be briefly defined as " tendency towards concomitant 
variation," and a so-called correlation coefficient (or, again, correlation 
ratio) is simply a measure of such tendency, more or less adequate 
according to the circumstances of the case. J. S. Mill, in his "System 
of Logic," distinguished a special scientific "Method of Concomitant 
Variations," which he based upon the following principle: 

"Whatever phenomenon varies in any manner whenever another 
phenomenon varies in some particular manner, is either a cause or an 
effect of that phenomenon, or is connected with it through some fact 
of causation*." 

The instances of this principle which Mill had in mind were mainly 
cases of approximately "complete" concomitance of variation, such as 
those usually met with in the domain of Physics. In such cases, the 
conditions of an experiment admit of a high degree of simplification, 
the phenomenon, or series of phenomena, under investigation can be 
isolated with tolerably complete success, and the " irrelevant " factors can 
be reduced to a minimum. Under such conditions, when the degree of 
concomitance of the different corresponding measures of the two pheno- 
mena is found to be very high, the slight deviations from complete 

* Logic, Bk. m. Ch. viii. § 6. 
B. & T. 7 
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correspondence are put down to "errors of observation" or other un- 
avoidable imperfections in the experimental method employed. 

If the correspondence is one of simple proportionality, so that the 
graphical representation of it (one phenomenon being measured along 
the axis of x, the other along the axis of y) is a straight line*, the corre- 
lation coefficient r will be unity. Example: the variation of the length 
of a metal rod with temperature. 

If the correspondence, although still approximately complete, is not 
one of simple proportionality, the graphical representation of it will be, 
not a straight line, but a curve of greater or less complexity f, and the 
correlation, also complete, will be measured not by the correlation 



\ 

:v. 



•V. 

• •V. 






•••■*/rf-^; 



Kg. 15 

coefficient r, but by the correlation ratio tj, which in this case will be 
unity. Example: the variation of the volume of a certain quantity of 
gas with the pressure to which it is subjected, the temperature remaining 
constant. A number of pairs of values P^, V^; Pg, F2; P3, F3, etc. is 
obtained, and when plotted they are found to give a "scatter diagram" 
of the approximate form of Fig. 15. 

In this figure, the dots represent the individual pairs of observations 
P, V. They cluster very closely about the hyperbola, PV = k, repre- 
sented by the broken curve. The curve is assumed to represent the 
"real" or "true" relation of the two "variates" (as we call such 
quantities as P, V), and the slight deviations of the observed values 

* Hence the correlation is said to be "linear.'' 
f Correlation aaid to be "non-linear" or "skew." 
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from this curve are explained as due to errors of observation and to 
other factors irrelevant to the relation under invesbigation. However 
this may be, the interesting point about the figure so far as our present 
purpose of explaining correlation is concerned is that any definite 
observed P-value is "correlated" with a plurality or "array" of observed 
F- values, and that, similarly, any definite observed F-value is correlated 
with a plurality of P-values. These arrays of observed values cluster 
extremely closely about their means (situated on the curve), i.e. their 
"scatter" or "variability," as measured by their standard deviations (a), 
is extremely small. 

The modern theory of correlation is directed towards the manipula- 
tion of observations made upon phenomena of a much greater degree 
of variability than that found in the case of isolated physical phenomena. 
The increased variability is no doubt due, in the main, to the complexity 
of factors involved. The elementary factors do not admit of isolation, 
and with reference to the concomitance of variation of the two series 
of phenomena under consideration they, as it were, pull in different 
directions. The correlation coefficient and correlation ratio measure, in 
these cases, the average extent of the concomitance. As will be explained 
more fully in the next chapter, r can only be taken as a measure of corre- 
lation when the average relation between the two variates is linear, and 
in this case its value is identical with that of ij. When the relation is 
non-linear r is practically meaningless, but r/ still measures the relation 
accurately. 

The general problem will become clearer by reference to the ac- 
companying figure. 

Let us assume that we have a group of 200 school-children and have 
measured each of them for mechanical memory (x) and for general 
intelligence (y). Each of the dots in the figure represents a child. Then 
if we determine the mean y-values corresponding to each successive 
"group" of a;- values, e.g. ajg to ojg, by assuming the observations con- 
centrated on the mid-ordinate PM*, the line AB'f drawn through these 
mean y- values (marked by crosses in heavy type) represents the law of 
change of mean y-value with increase of a; and gives the " most probable " 

* The true oentroid ordinate is slightly nearer the denser part of the scatter diagram, 
here slightly towards the right of PM. The correction is made later by means of Sheppard's 
formulae (see above, p. 84). 

•f The means have been placed on the straight line for the sake of convenience of 
exposition. Actually, they will occur irregularly on either side of it, and AB will be the 
"best fitting" straight line, determined by an application of the Method of Least Squares. 
See next chapter. 

7—2 
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value of y for any particular value of x. If the line is straight or ap- 
proximately straight the "regression" is said to be linear, and the 
equation to the line is 

y-y = r^ (x- x), 

where x, y are the mean values of all the x's and y'a respectively (not 
the means of the arrays just mentioned), ctj, (t^ are the standard 
deviations of the x's and «/'s respectively, and r is the coefficient of 
correlation. 




meeh. memory 



Kg. 16 



Thus if 



20 is the mean value of the mechanical memory of the group, 
30 „ „ „ „ general intelligence of the group, 

4 „ „ standard deviation for mechanical memory (ct^), 
7 „ „ „ „ „ general intelligence (a^), 

and, finally, -6 is the value of r; then the most probable measure of the 
general intelligence of a child whose mechanical meinory is represented 
by, say, the value 14, is given by the equation 

2/ - 30 = -6 X f (14 - 20), 
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whence y = 23'7. This value is the average of an array of possible 
values, whose standard deviation 

= 5-6. 

It will be proved in the next chapter that 

where x and y are deviations from the mean (not absolute values as 
assumed above), and S ( ) indicates summation, i.e. 

S (xy) = x^y^ + x^^ + x^y^-\r ... + Xj^y^^, 

where N is the total number of cases (children measured). 

It is important to note that by starting from y instead of from x, 
alid determining the means of the ^-arrays (such as the array within 
the limits y^^^ J/a^/s'); another regression line, CD, is obtained different 
from the first. Its equation is 

x-x = r^{y-y), 

and it represents the law of change of mean aj-value with increase of y. 
It gives the "most probable" value of x for any particular value of y. 

If the series of means do not lie on a straight line (approx.) but on 
a curve of greater or less complexity, the above calculation is meaning- 
less. In such a case, called a case of skew correlation and non-liaear 
regression, the only measure of the correlation of the two variates is that 
given by tj, the correlation ratio, t) is the ratio of the standard deviation 
of the means of the arrays (S) to the total standard deviation (of either 
the a;'s or the ^'s). Thus there are two values of ij, one for the x's, and 
another for the y's. They approximate closely to one another, as a rule, 
so that only one need be calculated. 

r,=^, or -?. 

When the regression is linear, tj = r; otherwise tj > r. r ranges 
between the values ± 1, tj between and 1. tj is always positive. 

It will now have become clear that the correlation ratio, r/ (always) 

* First suggested by Bravais; shown to be the best measure by Professor Karl 
Pearson, who gave it the name of the "product-moment" formula. A. Bravais, "Analyse 
mathematique sur les probabilites des erreurs de situation d'un point," Acad, des Sciences, 
Mimoires prisenUis par divers savants, IP Serie, ix. 1846, p. 255. Karl Pearson, F.R.S., 
"Regression, Heredity and Panmixia," Phil. Trans. Roy. Soc, 1896, CLXXXVii. A, 
pp. 253 ff. 
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and the correlation coefficient, r (when regression is linear) are measures 
of the tendency towards concomitant variation exhibited by two series 
of phenomena, and hence throw some light upon the causal relations 
of these phenomena. Exactly what kind of causal relation we are justified 
in inferring from them will become clearer in the course of the next few 
chapters. 

We may illustrate the significance of the idea of correlation in a 
slightly different (and more elementary) way. Let us suppose that the 
200 children have been arranged in order of merit, as regards mechanical 
memory, on the one hand, and as regards general intelligence on the 
other. If now it were found that each child's order was the same in 
both, i.e. that the child first in mechanical memory was first in general 
intelligence, the child second in mechanical memory was second in general 
intelligence, and so on, correspondence between the two series would be 
complete and r would equal +1. Or if, on a second supposition, the duld 
first in the one was last in the other, the child second in the one was 
next to last in the other, and so on, the correspondence between the two 
series would again be complete, but inverse, and r would be — 1. Finally, 
if there is no correspondence whatever between the two series, r will 
be zero. A value of r between and + 1 will express a tendency, greater 
or less according to r's size, for children above the average or mean 
position in the one ability to be above the mean position in the other, 
and for children below the mean position in the one to be below the 
mean position in the other. A value of r between and — 1 will express 
a tendency, greater or less according as r is numerically greater or less, 
for the children above the mean position in the one ability to be below 
the mean position in the other, and conversely. Now if order or " rank " 
be taken as an inverse measure of ability, the value of 

ler-^ or r 
becomes ^ _ &S {d^) 



N (N^ - 1) ' 

where d is the difference between the rank of an individual in the one 
series and his rank in the other. This form gives us a general impression 
of its appropriateness for the purpose in view, since the greater the 
disparity between the two series of ranks the greater is S (d^) and hence 
the smaller is r. If there is no relation at all between the two series, 
S (d'^) acquires the value it would have according to pure chance, and 
this can be shown to be N (N^ — l)/6, which makes the whole expression 
zero, as it should do. 
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The one objection to the formula is that it assumes the difference 
between any two neighbouring ranks to be equal at all parts of the 
scale. This is obviously a false assumption; the distance of individual 
from individual at the two extreme ends of the scale must be con- 
siderably greater than that between individuals near the middle. A 
correction for this, based on the assumption that the form of distribution 
of the abilities in each of the cases is Normal, has been calculated by 
Professor Pearson. It is 



2 sin 



where p = 1 — 



N (N^ - 1) ' 



At the end of this chapter is given a table whereby /j-values may at 
once be converted into corresponding r-values, according to the above 
equation. 

Finally, there is the question of the "probable error" (p.e.). Like 
every other constant calculated from a limited sample of variable 
material, the coefficient of correlation varies in value from sample to 
sample, and a measure is needed of the limits within which it may be 
expected with a fair degree of probability to lie. This measure is given 
by the probable error. In the case of r determined by the product- 
moment formula, when N is sufficiently large, 

•67449 ,, „, 

which means that it is an even chance that the true value of r lies 
between the limits 

•67449 (1 - r2) 



r ± 



VN 



The chances are 16 to 1 against the value falling outside the limits 
r ± 3 P.E. 

For r determined by .the rank formula, the probable error is slightly 
larger, being -7063 (1 - r^)/VN. 

If N, the number of cases, be small (say, less than 30), the probable 
error is larger. Its exact size under such conditions is not known. 

The following is an example of the way in which a correlation 
coefficient may be obtained by means of ranks. The subjects were boys 
in the Fourth Form of a Public School, and the correlation to be obtained 
is that between ability in Classics and ability in Drawing. 
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Form Order 










Clasaios 


Drawing 


d" 


R. C. 0. 


1 


9 




(1-9)^ 


= 64 


H. G. M. 


2 


2 









B. L. 


9 


16 






49 


F. L. S. 


7 


6 






1 


C. M. S. 


3 


15 






144 


C. J. L. H. 


5 


4 






1 


A. L. P. 


6 


17 






121 


E. G. T. 


4 


3 






1 


F. C. F. 


8 


5 






9 


N. P R. N. 


U 


14 






9 


H. B. D. 


10 


12 






4 


S. H. T. 


14 


7 






49 


H. B. M. 


12 


1 






121 


L. H. S. 


13 


8 






25 


J. P. C. 


15 


10 






25 


E,W. 


16 


18 






4 


C. C. M. 


17 


11 






36 


L. H. W. 


18 


13 






25 


E. M. J. 


19 


19 









N = 19 










688=S(d8) 


p=l- 


6S (d^) 


Tj='- 


6 
19 


X 688 
X 360 


= •40, 




:. r=2 




= 


•416, 






•7063 (1 - r2 

P.E. = ^= 


)^ 


= -134. 





' ' Vn 

r is here just over three times its probable error, and we might 
therefore feel inclined to conclude that it proves a real correlation 
between the two series. We must remember, however, that 19 is a very 
small number of cases, and that therefore the real probable error is 
considerably larger than that given by the formula. Hence the reality 
of the correlation is not so certain. Our caution is proved to be justified 
when we turn to the next higher form, the Remove, and find that, with 
the same number of boys, the correlation between ability for Classics and 
Drawing ability works out as — -SIS (± •14), quite a different result. It 
might be objected that other factors than mere smallness in the number 
of cases were responsible for the difference; e.g. that the tendency to 
specialise in Classics was greater in the Remove than in the Fourth, and 
that the consequent neglect of Drawing by the abler boys lowered the 
correlation. To this it may be replied, firstly, that the drawing-master 
was the same for both forms, and was likely to get as much out of the 
boys as possible in each case, and, secondly, that the difference between 
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the two forms in respect of the degree of specialising tendency was iii- 
sufficient to account for the disparity of the results. 

The correct way to compare the results mathematically is to deter- 
mine the probable error of their difference. This = the square root of 
the sum of the squares of the probable errors of each*, i.e. 

which, in this case, = V-13^ + -14^ 

= ■19. 

The difference = -ilQ + '313 = -73, nearly four times the size of its 
probable error = -19. 

A very important extension of the theory of correlation is the con- 
ception of "partial" correlation. If, e.g., three mental abilities are 
correlated with one another, it is of interest to know how closely any 
two of them are correlated with one another for a constant value of the 
third. Such a coeflScient is written, in Yule's notation, r^^.s- 

This may be illustrated from our example by taking the form order 
for English into consideration in addition to that for Classics and that 
for Drawing. The correlation between Classics and English works out 
as -78, that between Drawing and English, as -21. 

Then the correlation between Classics and Drawing for "English 
constant" is 



I^CD.E = 

VKt- — rcET) VK'- — i^DBT) 

.zt9 _ .7» V .01 

•42. 



•42 - -78 X -21 



^(1 - -782) V(l - -212) 

Thus in this particular case, the "partial" coefficient is practically 
identical with the " entire " coefficient. 

If therefore boys were selected, out of a population of which 
the actual form is a random sample, so as to be all equal in their 
"English" ability, the correlation between their "Drawing" ability and 
their "Classics" ability would be unaffected. Of course such a set of 
boys, in addition to being all alike in English, would be less scattered 
in both Classics and Drawing (especially in the former) than are the 
boys of the actual form, and their average ability in these subjects 
would be higher or lower than that of the actual form according to the 
level of ability in English at which they had been selected. 

On the other hand the partial correlation of English and Drawing 

* See p. 24. 
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for "constant Classics" will be found to be — -2, so that selection for 
Classics creates a negative correlation between English and Drawing, in 
so far as we can judge from this particular case. 

The reader must be warned against the temptation to draw deduc- 
tions as to the "common factors" uniting any of these pairs of subjects. 
The fallacy underlpng such reasoning is discussed in pp. 139 — 145. 

The principle of partial correlation can be extended to include an 
indefinite number of variables, and general formulae for this purpose 
wUl be given in Chapter VII. 

It is obvious that when the subjects in the group examined are not 
all alike in respect of some irrelevant factor such as age, these same 
formulae can be employed to ascertain what the correlations would be 
in a group which was homogeneous with regard to the factor in question. 
Great care has to be taken in interpreting the results of such calculations 
however, as fundamental assumptions may not be satisfied. This wiU 
become clearer in the following chapters. 

Table for converting p into r(r = 2sin^/3j*. 



p 


r 


P 


r 


P 


r 


P 


r 


■05 


•052 


•30 


•313 


•55 


•568 


•80 


•813 


■10 


■105 


•35 


•364 


•60 


•618 


•85 


•861 


■15 


■157 


•40 


•416 


•65 


•668 


•90 


•908 


■20 


•209 


•45 


•467 


•70 


•717 


•95 


•954 



•25 ^261 ^50 -518 ^75 •765 l-m bOOO 

* Quoted from K. Pearson, P.R.S., Drapers' Company Sesearch Memoirs, Biometric 
Series, iv. 1907, p. 18. 
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THE MATHEMATICAL THEORY OF CORRELATION 

Correlation coefficient r — Correlation ratio ij — Probable errors — ^The normal correla- 
tion surface and its properties — Other methods of determining correlation — Fourfold 
table — ^Method of contingency — ^Two-row table — Short methods — The method of 
ranks — Spearman's foot-rule — Correlation of sums or differences. 

In the present chapter an attempt will be made to summarise briefly 
the principal methods in use for obtaining a measure of the correlation, 
or tendency towards concomitant variation, of two or more variates. 

Let the coordinates of the dots in the accompanying diagram — 
commonly known as a "scatter diagram" — ^represent the measures of 
two separate characteristics, e.g. speed of adding figures (x) and accuracy 
of adding figures (y), in a number of individuals (N). 



I 






:A\-Siit 



^^S^^^c' 



C -v.-. 



O speed of adding (x) 

Fig. 17 

Let the crosses represent the mean values of y corresponding to 
values of x lying between the limits of pairs of successive units of measure- 
ment. Then the broken curve CC passing through these crosses repre- 
sents the most probable law of relationship between speed of adding 
and accuracy of adding, and is known as the regression curve. (In 
practice the crosses do not lie so accurately on the curve.) 

(1) CORRELATION COEITICIENT (r) 

What we chiefly want to know, however, even when the regression, 

as here, is not linear, is (1) whether large x is on the whole associated 

with large y, etc., (2) how to find roughly the mean y associated with 

given X. To do this, we find the "best fitting" straight line, LL', to the 
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swarm of dots in the figure, using, merely from motives of convenience, 
the Method of Least Squares*. 
Let the equation of LL' be 

Y = b^X + c. 
Then applying Least Squares will make S (y — Y)^ a minimum, where 
y is the ordinate of any dot, and Y the ordinate of the line at the same 
abscissa, which is both X and x. There will be as many equations 

(y-Y) = y- {b^jx + c) = v 

as there are dots, and the d's correspond to the "residuals" of the 
Method of Least Squares, though here they are real deviations and not 
errors. The Normal Equations for b^x and c, formed according to the 
rule on p. 45, give at once 

S (xy) - 6ai S (x^) - cS (x) = 0, 
Siy) -b,,S(x) -cS{l) = 0. 
If there are N points, S (1) = N; and if the as's and y's are measured 
from the mean of the whole, S{x) = S (y) = 0. The equations then become 

S(xy) = b,,S(x% 
Nc = 0. 
Thus c is zero, that is the line LL' goes through the mean. And b^, 
which is the tangent of the slope of LL', is 

S(xy)_S(xy) 
^^ S (x^) Ngj^ ■ 
The line LL' is known as the regression line, and &2i as the coefficient 
of regression of y on x. If we define r as 

. S{xy) 





Nojor^ ' 


then 

and the equation to LL' is 






Y = r-'X, 



"^1 
X and y being measured from their mean values. 
An analogous equation 



X = r^Y 



gives the regression of x on y. There are thus two regression lines. If 

* G. Udny Yule, "On the Significance of Bravais' Formulae for Skew Correlation," 
Proc. Boy. Sob. 1896, LX. pp. 477 — 489. 



OH. VI] MATHEMATICAL THEORY OF CORRELATION 109 

X and y, in addition to being measured from their means, are also 
measured in terms of their standard deviations as unity, the regression 
equations become 

X = rY and Y = rX, 

and r is then itself the coefficient of regression oi y on x and of x on y, 
the two regressions being equal. 

Since y — Y measures the distance in the y direction of any point 
from the regression line, the quantity 

S(y-Y)^ 
N 
gives the mean square deviation, in the y direction, of all the points 
from the regression line, 

S(y- \ixY S (/) S{xy) S (x^) 

Hence the standard error or deviation made in estimating, by means 
of the regression equation, the value of y most probably associated with 
any particular x, is, on the average, given by 

and if the distribution is Normal it has this value not only on the average 
but for each array. 

r is known as the coefficient of correlation, and evidently must lie 
between the values + 1 and — 1. If the regression line coincides with 
the regression curve, within the limits of errors of random sampling, — 
in other words, if the regression is linear — r is a measure of the degree 
of dependence between x and y. When r = ± 1, the points close up upon 
the line and the "scatter diagram" contracts to become the line itself. 

S (xy) 
The formula r = ^ 

is implied in Bravais' work of 1846, and was shown by Professor Karl 
Pearson in 1896 to be the best measure of r. Hence it is known as the 
Bravais-Pearson Product-Moment Formula. It may be written 

S(xy) 
'''VS{oo')VS{y^)' 
the denominator being the geometrical mean of the two second moments, 
and the numerator the product-mommt, of x and y. 
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If X and y are not measured from their means, but from some con- 
venient point distant rf^ from the x mean and d^ from the y mean, the 
arithmetic is very considerably lightened, and the formula becomes, as 
may be tested by simple algebra, 

_ S jxy) — Ndjdz 

' - V{S (x') - Nd^^} V{S iy') - Nd^} ■ 

The following example is intended to show one form of calculation 
based on this formula. Being only a model for this purpose, it is kept 
short so that the arithmetic can be easily followed. But it must he made 
quite clear that really to calculate correlations with only ten cases is absurd, 
for the probable errors are enormous and moreover are unknown (see 
p. 114). A calculation with larger numbers, and made by a slightly 
different process, is given on p. 115 et seq. 

In the following table 4 and -B are the percentage errors made by 
certain cadets in a test in judging distance, in the years 1915 and 1916 
respectively. 



Name 


A 


B 


x=(A-\&) 


y=(B-n) 


x^ 


y' 


xy 


Cadet A 


15 


15 


-I 


2 


1 


4 


- 2 


„ B 


19 


10 


3 


-3 


9 


9 


- 9 




, c 


11 


14 


-5 


1 


25 


1 


- 5 




. D 


17 


16 


1 


3 


1 


9 


3 




. E 


8 


18 


-8 


5 


64 


25 


-40 




, I' 


14 


12 


-2 


-1 


4 


1 


2 




. G 


24 


14 


8 


1 


64 


1 


8 




, H 


28 


U 


12 


-2 


144 


4 


-24 




, K 


7 


8 


-9 


-5 


81 


25 


45 




, L 


14 


14 


-2 


1 


4 


1 


- 2 


Means 15-7 


13-2 




Sums 


397 


80 


-24 


Provisional 16 


13 




Subtract 


0-9 


0-4 


- 0-6 (i.e. NM) 


d,= -l 


! (^2=2 






3961 


79-6 


-23-4 















23-4 



-v/(396- 1x79-6) 



= -•13. 



(2) CORRELATION RATIO (r,)* 

It is clear that if the regression is not linear r ceases to be a satis- 
factory measure of the relation between the two characters under con- 
sideration. In an extreme case, such as that shown in the accompanying 
diagram, r may be zero while there is yet a very close relation between 
the two characters. 

Clearly, if the individual observations, i.e. the dots in the figure, are 

* See Drapers' Company Sesearch Memoirs, Biometrio Series, n. p. 9 et seq. Karl 
Pearson, "On the Theory of Skew-Correlation and Non-Linear Regression." 
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all exactly situated on the regression curve, the quantity y is an exact 
mathenaatical function of x, and correlation is perfect, or ij = 1, where 
17 is a new and as yet undefined measure of correlation, called the 
correlation ratio; while if the individual observations are much scattered 
right and left of anyone walking along the regression ciirve, the correla- 
tion is imperfect, and ij < 1. 

If there is no scatter at all in any array, then the correlation is 
perfect, and the greater the scatter the less the correlation, i.e. the less 
certain is any prediction of y from x. 

Prof essor Pearson therefore makes the correlation ratio ij depend on the 
amount of scatter in the arrays. Exactly, he makes it depend on the mean 
of the weighted squares of the standard deviations of the arrays, i.e. upon 

o-^2 = /SKa„/)/iVr. 



■t^: 



^ 









Pig. 18 

The correlation ratio rises and falls as this quantity falls and rises. m„ 
is the number of cases in the array of which a„^ is the standard deviation. 
Clearly the unit in which CToj,^ is measured for this purpose must 
depend on the standard deviation of the whole of the y's. If the arrays 
are less scattered than the whole, there is correlation. If there is no 
correlation, any array will have just as much scatter as the whole has. 

Pearson writes 

a 2 

-^ = 1 - rj2- 

<^2 

Compare this equation with that found above for the case of linear 
regression, p. 109, namely 

"average" mean square deviation of an array = a^^ (1 — r^), 
which in our present notation is 

^ = 1 - j-2 

0^2 
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and from the comparison we see that Pearson has chosen rf so that it 
becomes r^ for linear regression. The correlation ratio -q however never 
becomes negative but is in linear regression numerically equal to r. 

The above formulae already allow of the calculation of r^, but a 
simplification is possible. By its definition, 






■■ S' {y^ - y^YIn^, 



S' being a summation up and down an array, so that 

a^^ = SS'{y,-y,r/N, 
S being a summation at right angles to the former, i.e. a summation 
of the arrays. That is to say, the mean of the weighted squares of the 
standard deviations of the arrays is the same thing as the mean square of 
the distances of the dots (measured in the y direction) from the regression 
curve. This simplifies to 

Na^^ = SS' (y/) - 2SS' (y,y,) + SS' (%^) 
= Na,^ - 2S {y,S' (y,)} + S (n.yj^) 
= Na,^ - 2S (y^n,%) + S (n,y^) 
= Nai - S (n,y,^) 
= Nai - mi, 

where Sg is the standard deviation of the means of the arrays, each array 
being weighted with the number of cases in it. Therefore 



a„J = o-c^ 



■^2 > 



-"Oj, — "2 ^2 

n*. 

,,2 



'- 'I „2~ „2 ~ ^ _2' 



"2 '^'i 

S2 

<^2 

The correlation ratio, therefore, is the ratio of two standard deviations, 
one of the means of the arrays (properly weighted), the other of the whole. 
Starting from the other variate, we arrive in a similar way at a 
second value „ 

T? = — . 

Since ij is the ratio of two standard deviations, it must always be positive. 
Let Y be the ordinate of any point on the regression line, then the 
average of the sum of the weighted squares of the distances between 
the regression line and the regression curve 

.. S{n,(y,-Yf} 
N 
which reduces to cr^ (if — r^). 
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Thus r] must always be numerically greater than r, except in the 
case of linear regression, when it is numerically equal to r. 

In examining the relationship between two measurable characters, 
7] should be calculated as well as r, since it serves as a test of the linearity 
or non-hnearity of the regression, and is also a better measure of causal 
relation than r. 

A simple criterion for Unearity which is very generally apphcable is 
that ,_ 

Vn 



•67449 



.iVr]^-r^<2-5*. 



For very exact work, more complicated formulae need to be em- 
ployed. 

The results obtained above are all independent of the forms of dis- 
tribution of the variates. 

(3) PROBABLE ERRORS 

In determining means, standard deviations, and other frequency 
constants, the investigator is unable to work from the " total population " 
and must be content with the results obtained from "random samples" 
of greater or less size taken from this (in some cases, hypothetical) total 
population. 

When the number of cases (n) in the random sample is fairly large — 
so large that fractions containing certain higher powers of n in the 
denominator can be neglected — the probable errors are found to be as 
f oUowsf : 

p.E. of a mean = -67449 — ;=, 

Vn 

„ „ --67449^^, 

1 _ y2 

„ „ r =-67449 - 



Vn 

The second and third of these values are only correct when the 
frequency-distribution is normal or approximately normal. In parti- 
cular, for large values of r the true p.e. may be considerably different 
from that given by the above formula unless the distribution is normal. 

1 — rj^ 
p.E. of rj = -67449 — 7=i- , for linear regression, and also, as a rough 

Vn 

* J. Blakeman, Biometrika, rr. pp. 349, 350. 

t See W. Gibson, "Tables for FaoiKtating the Computation of Probable Errors," 
Biometrika, iv. p. 385 et seq. and Pearson's Tables. 

B. & T. 8 
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measure, for cases of skew correlation. If greater exactitude is needed 
in the latter cases, more complicated formulae have to be employed*. 
Another frequency constant in common use is the coefficient of 

variation V, which = . 

mean 

Its p.E. = -674497 |l + 2 (j^)V/v'2^t- 

As stated above, the values just given for the probable errors only 
apply in cases where n is fairly large. In cases where n is so small that 
certain higher powers of its reciprocal cannot be neglected in comparison 
with the rest of the expressions involving them, the values cannot be 
used. For such cases no theoretical formulae have hitherto been devised. 

An empirical investigation has however been madej on samples of 
4, 8, and 30 cases, taken from a "total population" of 3000 pairs of 
measurements (height and left middle finger measurements of 3000 
criminals; "real" correlation, •&&). From the results obtained it may 
be concluded that, although in the case of such small samples as 4 or 8 
the ordinary formula for the probable error of r gives much too low a 
value, yet in the case of as many as 30, the formula applies with tolerable 
accuracy. We must, however, bear in mind that this result has only 
been proved (empirically) to hold in the single case when the actual 
correlation was -66. 

The calculation of the probable errors of means, standard devia- 
tions, coefficients of variation, and coefficients of correlation is very 
much facihtated by the use of Pearson's Tables for Statisticians, es- 
pecially Tables V, VI, VII and VIII, calculated by members of the 
staff § of the Biometric Laboratory, University College, London. 

We give next an example of the evaluation of r and of rj between 
speed of adding single digits and accuracy in doing so, the individuals 
measured being 86 boys between the ages of 11 and 12 years from two 
L.C.C. elementary schools. The two groups could be thrown together 
for this purpose, since the means and standard deviations calculated 
from them separately were in very close agreement — well within the 
limits of the probable errors. 

* Karl Pearson, op. cit. (Biometric Series, ii.) p. 19. 

t Calculated values for different values of n given in Gibson's Tables, see pp. xxii 
and 18 of Pearson's Tables. 

t "Student" : "The Probable Error of a Coefficient of Correlation," Biometrika,lSOS — 9, 
VI. p. 302. 

§ Miss Winifred Gibson, X)r Raymond Pearl, T. Blakeman, Dr David Heron, Miss 
H. Gertrude Jones, H. E. Soper. 



CH. VI] MATHEMATICAL THEOKY OF CORRELATION 115 






>o 



"Sb 

I— ( 



QQ 


o 






C3 








>. 


^ 




«i 


(N 


2 




^0 






^e 


1 


M-( 




b>( 


1— t 

f— t 


o 

s 
o 






'rt 






<1) 








60 






•^ 


CIj 


Ti 




^ 


m 


'^ 






t 






O 


CO 


+3 






00 


§ 

a 

OQ 








fl 
Ol 

^ 


4 






+3 








aj 


fl 






nQ 


O 








pi 






OS 


T3 






« 








M 








M 








o 








O 







I 



I 



»3 



1 



"^ 
-i>i 



<X1 









6>i 






4 



o 

CO 

o 






to 

6"3 



+ 



0=0 



5^ 






U5 



00 

II 



I— I -^ 
OO 

II II 



C4 



+ 



o 
+ 



o 

o 
o 



I 



§i3 



S' 



(^) uoi!jippv JO jfoBinooy 



-1 

°0 


1 




«rt 


•s 


"3 




C> 


1 


1 


2 


£ 






'^ 


»> 


« 








^ 


oa 


+J 


T) 




C3 


o 


d 



^ 


00 






-t> 








^ 


s 


•j^ 


0) 


^ 


J^ 


S 


ID 




^ 






(D 


n 




C) 




3 


.a 


^ 



S s 



> a 

a ■§ 
II 

§1 



■■a a 

m ° 

S -a 

60 * 
<D O 

ja <^ 

Eh o 

ja 

. H 



8—2 



116 



CORRELATION 



[PT. II 



Frequency 


x' 


Frequency 
xa;' 


Frequency 
xa;'" 


Frequency 


y' 


Frequency 
y.y' 


Frequency 


3-5 
18-5 
14-5 


-3 

-2 

-1 



1 
2 
3 
4 


10-5 

37 

14-5 


31-5 

74 

14-5 

15-5 
42 
22-5 
48 


6 

4 

3 

9 

18 

25 

21 


-5-5 
-3 

-2 

-1 

1 
2 


33 

12 

6 

9 


181-5 
36 
12 


18 


-62 


9 


15-5 


15-5 
21 
7-5 
12 


-60 


— 


10-5 
2-5 


25 

42 


25 

84 


3 


86 =N 


+ 67 


347-5 


S6=N 


+56 


248 


+ 7 






-6 





d^= 



86 
248 



= - 0-06977, 



1 



"1 - 86 

= 2-7955, 
C7i = 1-67. 



-W^-i^*> 



12 



^2 = 86 = 0-0814, 



2 347-5 

= 4-0341, 
ag = 2-013. 



-dit 



Frequencies 


x'y'X 


Total 
frequency/ 


f-xx'y' 


1+5-5 -4-5 -1-5 


1 


0-5 


+ 0-5 


2-5+0-5 + 2-5 + 3-4-5-5-2 


2 


-3 


-6 


1+0-5 + 0-5-1 


3 


1 


3 


2 +2-5 + 0-5 -2-6 


4 


2-5 


10 


0-5-1 


5-5 


-0-5 


-2-75 


1+2-1 


6 


2 


12 


0-5 


8 


0-5 


4 


1 


'9 


1 


9 


3 


11 


3 


33 


-1 


22 


-1 


-22 




71-5-30-75 




>S(a!y) =40-76 



* Sheppard'a correction. Bemember that a^=ii.2, and d=vi'i see above, p. 84. 
t Sheppard's correction cannot be used here, since the units of the subgroups are 
not equal and there is not high contact at the ends of the frequencies. 
X The figures in italics in the correlation table. 
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r = 



S (xy) = S (x'y') - Nd^d^ 

= 40-75 + 86 X -0057 
= 41-24, 
Sjxy) 41-24 



Najcr^ 



p.E. = -67449 



86 X 1-67 X 2-013 
l-r2 



0-143, 



= 0-071. 



J-speea of addition = 0-14 ± O'OT. 

ace. of addition 



Vx -y 


(yx-yy^^'>^x 


-0-224 


•050176 X 3-5= -175616 


-1-081 


1-168561x18-5 =21-618379 


0-488 


■238144x14-5= 3-453088 


0-322 


-103684x18 = 1-866312 


0145 


-021025x15-5= -325888 


0-490 


■2401 xl0-5= 2-52105 


1-719 


2-954961 x 2^5= 7-387403 


- 1-414 


1-999396 X 3 = 5-998188 




^Kfe-S?)}" =43-345924 



43-345924 



- 86 X 4-034 = «-12*«' 
.-. •>j = 0-353, 
p.E. = -67449 (1 - 'n^yvN 

= 0-064. 

-■. ■>?speed of addition = 0'35 ± 0'06. 
aco. of addition 

Calculating r/ from the means of the y-arrays, we have 

S {riy (Xy — Xf} 



yf = 



19-68101 



86 X 2-796 
= -0819. 
.-. 17 = 0-29 ± 0*07. 

To test the value of iq obtained from the means of the a;-arrays, for 
linear regression : 

VN , . ,~, ^^ -323 



•67449 



lVrf-r^*=- 



2 X -07273 
= 2-22, i.e. < 2-5. 

Hence regression may be considered to be linear. 

* See p. 113. 
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Regression coefficients: 



fe„ = r-i=-118, 



Equations to regression lines are 



•172. 



and 



I.e. 






y), 

■X). 



equation to regression line AB is 

y - 164-2 = -172 (x - 237-7), 
y = -172a; + 123-316 

237-7 (mean speed) 



.(i). 



TIU 

125 

i 
1 

t 

^ 155 
o 






/ «= 
































A 


A^ 


\, 












/ 






V — - 




\ 








/ 


"^170 

i 


me 


n aaeanq/ 


\ 




^ 




~^ii 










\ — 


1 


\1 




X 


w 


B 












ID 






V 






^A 


1€ 


iO ■ 2 


6 


26 


8C 


3^ 


K> 3 


10 



speed (no. of figures added in 10 minutes) 
Kg. 19 

Similarly the equation to the Une CD is 

X - 237-7 = -118 (y - 164-2), 
i.e. a; = -118?/- 218-324 (ii). 

Equation (i) gives the most probable value of y associated with a 
given value of x, with a standard error 

(T2V(l-r\ i.e. 1-991. 

Similarly, mutatis mutandis, with equation (ii). 

As a model of the proper use of the correlation coefficient and ratio 
we may cite a very detailed investigation into the relationship of in- 
telhgence to the size and shape of the head, and to other physical and 
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mental characters, by Prof. Karl Pearson*, which appeared in Bio- 
metrika, v. 1906 — 1907. The subjects measured were 1000 Cambridge 
undergraduates and considerably more than 5000 school-children. Special 
care was taken in drawing up a quantitative scale of intelhgence, ad- 
justments being made so that the results fitted a "normal" or Gaussian 
distribution. The correlations were worked out in several different ways, 
— correlation coeflicient (r), correlation ratio (rj), coefficient of mean 
square contingencyf, and the method, first suggested in this paper, of 
the analograph. Pearson describes this last method as follows : " In the 
case of intelligence, I take a normal scale as my base line and plot up 
the percentage of the character for each grade of intelhgence along the 
centroid vertical of the corresponding range, drawing a horizontal line 
to represent the mean percentage in the population at large. We thus 
obtain a diagram, which I will venture to call an analograph. 

" If the percentage increases or decreases continually with intelhgence 
(or with the base character, whatever it may be), I term the relationship 
homoclinal; if the percentage does not reach its maximum with the 
maximum or minimum of intelhgence, I term the diagram heterocUnal." 

(4) THE NOBMAL CORRELATION SURFACE AND ITS PROPERTIES 

We have so far considered the correlation coefficient r from the point 
of view from which it was approached by Galton, who measured it by 
the inchnations 6-^ and 6^ of what we have called the regression lines, 
according to the formulae 

r = — tan ^i = — tan $2 = ^(^^11 ^i ^^^ ^2)- 

We followed this up by an application of the Method of Least Squares, 
first made by Mr Udny Yule, obtaining the more advantageous formula 

Naj(T2 ' 
known as the Bravais-Pearson Product-Moment Formula. All we have 
so far done is independent of the form of distribution of the correlated 
variates. 

From a historical point of view however this is not quite the way in 
which the subject developed. In 1846 A. Bravais pubhshed, in Vol. ix 
of the MSmoires de Vlnstitut de France, an article entitled "Analyse 
mathematique sur les probabiUtes des erreurs de situation d'un 

* Karl Pearson: "On the Relationship of Intelligence to Size and Shape of Head, and 
to other Physical and Mental Characters," Biometrika, 1906 — 1907, v. pp. 105 — 146. 
■f See below. 
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point," which was really a very complete study of normal correlation, 
although this is not perhaps obvious to anyone casually glancing through 
the volume. (He uses the word "correlation" itself, however, on p. 263.) 

Galton's work was done in ignorance of this article, and of course 
was work apphed directly to certain social and anthropological problems, 
whereas Bravais' is a piece of mathematics only. Also in ignorance of 
Bravais' work, Professor F. Y. Edgeworth* took some important steps 
on the road later followed by Professor Karl Pearson, who connected 
Galton's work with Bravais', adopted the product-moment formula 
which was impUcit in the latter's equations, and showed that it is the 
best formula for the purpose, i.e. it has the least probable error. 

Mr Udny Yule still later pointed out the simple method of arriving 
at this formula which we have used in an article of the Proceedings of 
the Royal Society for 1896 — 7, the chief importance of which is that it 
points out the fact, then first adequately reahsed, that the product- 
moment formula has a definite significance even if the distribution of 
errors is not normal. 

In the present section we shall now consider more definitely normal 
correlation. 

If the X variate is normally distributed according to the law 

and the y variate according to the law 

vm^/^' <^'- 

thentheprobabihtyof simultaneous occurrence of a value x and a value y is 

P = „— ^e 2V<ri» ^^'Jdxdy (3), 

provided x and y are independent or uncorrelated. If however they are 
correlated, this probabiUty is 

1 / x' 2rxy y^ \ 

which reduces to the former expression when r, the coeflGicient of 
correlation, becomes zero. The surface 

z = P'/dxdy (5) 

* Phil. Mag. 1892 and 1893, several articles. In a paragraph burled in one of these, 
indeed, Professor Edgeworth reached, but did not realise the importance of, the product- 
moment formula, which he there gives as the best formula {Phil. Mag. July 1893, Series 5, 
XXXVI. on p. 100). 
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is the normal correlation surface, and is a hillock shaped Hke a bell 
with an oval mouth. 

When two variates are recorded on a grid-iron table Hke that used 
in the above example of Speed of Addition and Accuracy of Addition, 
the resulting table is called a "correlation table*." As, owing to the 
experimental difficulties of the subject, psychological correlation tables 
are seldom smooth enough to illustrate vividly the points about to be 
mentioned, we quote here a correlation table from the study of heredity 
which has already been used elsewhere for a similar purposef . 

Father's Stature 





Inches 


59 


60 


61 


62 


63 


64 


65 


66 


67 


68 


69 


70 


71 


72 


73 


74 


75 




60 








2 


2 


4 
























6] 










2 








4 




















62 




1 


1 




2 


4 


1 


1 


2 


2 


















63 


. 


1 


1 


9 


9 


8 


16 


20 


11 


5 


. 


1 


1 












64 


4 


. 


6 


15 


12 


17 


32 


37 


12 


6 


6 


3 


5 












65 


8 


4 


2 


8 


13 


38 


54 


43 


30 


22 


14 


10 














66 


. 


2 


4 


9 


21 


38 


40 


67 


70 


64 


21 


8 


10 


4 










67, 




6 


8 


19 


14 


55 


79 


106 


103 


78 


50 


55 


13 


2 


4 






s 


68 






6 


8 


30 


M 


41 


97 


126 


94 


118 


53 


34 


38 


9 








69 






4 




21 


20 


51 


73 


64 


96 


116 


86 


40 


14 


9 




4 




70 










4 


10 


23 


75 


47 


78 


90 


78 


58 


25 


14 


6 


4 




71 












13 


20 


35 


43 


76 


59 


83 


43 


32 


20 


4 


4 




72 












1 


12 


5 


28 


31 


43 


45 


40 


34 


11 


2 






73 














3 


3 


10 


30 


26 


24 


30 


25 


13 


2 


2 




74 










4 




6 


6 




21 


9 


10 


26 


13 


13 




8 




75 




















4 


8 




10 


3 


7 


2 






76 




















5 


1 




2 


4 


4 








77 




















5 


1 


4 






6 








78 






















4 


4 




1 


3 








79 




























1 


1 







For simplicity in printing, the numbers in the original table in Biometriha, 1902 — 3, 
n. p. 416, have been multiplied by four: this eliminates quarters and halves which occur 
through some heights being halt-way between whole inches. 

* It is very unfortunate that Mr J. C. Maxwell Gamett has recently used this term 
(already fixed in the meaning given in the text) for something quite different. Proc. Soy. 
Soc. 1920, xovi. A, p. 100. 

t E.g. Mr Udny Yule's text-book on the Theory of Statistics; and, independently, 
by G. H. Thomson, "Mathematics and the Inductive Methods of Logic," Proc. Univ. 
Durham Phil. Soc. 1912—13, v. pp. 76—99. 
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It is interesting to look at a correlation table in greater detail. If 
we think of it as a plane horizontal surface, and erect over the centre 
of each compartment a vertical Une proportional to the number written 
in that compartment, then the tops of these Unes touch the correlation 
surface. 

It is clear from the equation to the surface that the contours, or 
lines of equal z, are elUpses, given by the equation 



x^ 2rxy f 



+ 



-fl"2 



. 2 



= constant 



.(6). 



In the figure the numbers 40 and over have been printed in italics so 
that this contour fine can be approximately followed. It is seen to be 
roughly elhptical, the major axis of the elhpse lying obliquely. The 
major axes of all the contour ellipses of a surface showing correlation 
are inchned to the axes of coordinates; and if, as we always can, we 
choose the linear units of x and y on the diagram in such a ratio that 





Kg. 20 



Kg. 21 



<^i — <^2' tben this incHnation will be at 45°, as shown in the accompanying 
figure (Fig. 20). 

In the case of no correlation, »■ = 0, and if o-j = ctj, the equation (6) 
given above for the contour lines represents a circle. In this case, 
therefore, with suitable units of x and y, the contours are circles, and the 
correlation surface is a perfectly symmetrical hillock. As correlation 
increases, the contour lines become more and more drawn out along 
the 45° fine (or the 135° fine for negative correlation), as suggested 
in the figure (Fig. 21). 

In a contoured plan of a correlation surface, those lines through the 
origin are important which cross all contour lines at points where the 
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tangents to the contour lines are parallel to the axes of coordinates. 
Such are L^L^ and £2-^2 ^^ ^ig- 20. If we differentiate equation (6) 
with regard to x and equate to zero we shall obtain LJL^. We find 

2x 2ry 



= 0, 



or simplifying 



X y 



Similarly the equation of L^L^ is 

y X 

That is, these are the regression lines. 

We have said that any horizontal section of a normal correlation 
surface is an ellipse. Any vertical section, on the other hand, is a normal 
probability curve. Consider first a vertical section parallel to the a;-axis. 
Write y = cin equation (4). 

After a Uttle simple algebraical arrangement this then reduces to the 
form g2 . 

This is a normal curve, with area 



Its centre is at a; = rcaja^, y = c, i.e. on the regression Une. Its 
standard deviation is (jj^-\/{l — r^) and is independent of c. 

Similar statements hold for a vertical section along a fine x = c', 
the constant standard deviation being crgVil "" ^^)- 

A similar procedure will show, with rather more cumbrous algebra, 
that any vertical section is a probabihty curve. 

(5) OTHER METHODS OF DETERMINING CORRELATION 
1. Fourfold Table. 





'h 


X2 




Vi 


a 


b 


a + b 


Vi 


c 


d 


c + d 




a+c 


b + d 


N 
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(a) When the divisions pass through the means of both characters, 

TT (a — b) 



r = sm 



2(a + b)' 



This formula (Sheppard's) is of little practical use, since the mean 
values, in cases where the fourfold table is the only method which can 
be used, are generally unknown. 

(6) In cases where the dividing Unes do not pass through the means 
(means not known) the equations for determining the correlation are : 



and 




a 



d 



Fig. 22 



( g + c) - (6 + d) /2 fi 



dx 



N (the total no 

(the cross-hatched area on the upper curve) 



■ea on the sid 
from which h and k can be obtained by Sheppard's Tables; and finally 



(a + 6)^(c + ^) ^^,^^_^.^^ 
(the cross-hatched area on the side curve) 
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ad — he 

Whk 



= r + -^U + ^ (h^ - \) {h^ - \) + J h {h^ - 2.) k {B - 3) 



6 



r5 



where 



+ J20 ('^^ - 6^' + 3) (^ - 6F + 3) 

+ ^h{¥- 10^2 + 15) h{¥~ lOP H- 15) 

+ i (^* - ISA* + 45A2 - 15) (*« - 15F + 45F - 15) 
5040 ' 

+ etc., 

V'2m V2^ 



In obtaining these equations, a normal correlation surface is assumed, 
with the equation : 



N 



1 1 
2 



1 /x^ 2rxy y^\ 



The probable error of r obtained by the fourfold table method is 
much larger than that given by the formula of p. 113. The correct 
formula is too complicated to insert heref. 

2. Method of ContingencyJ. 

The following is an example of a contingency table §. 

Tathers 





Merry 


Melanqholy 


Alternating 


Even 


Totals 


Meny 
Melancholy 
Alternating 
Even 


122 
10 
70 

58 


8 
2 
9 
6 


81 

7 

101 

66 


67 
10 
68 
45 


278 

29 

248 

175 


Totals ... 


260 


25 


255 


190 


730 



* AU this is rendered much easier by P. F. Everitt's "Tables of Tetrachoric Functions," 
Biomelrika, vn. pp. 437 — 451, or Table XXIX in Pearson's Tables. 

j- Unless the dichotomies are extreme, good values are obtained from the use of 
Tables XXIII and XXIV, Pearson's Tables. 

J Karl Pearson, "On the Theory of Contingency and its Relation to Association and 
Normal Correlation," Drapers' Company Research Memoirs, Biometric Series, i. 1904; 
Dnlau and Co., London. 

§ Taken from a paper by E. Schuster and E. M. Elderton on "The Inheritance of 
Psychical Characters (being a further Statistical Treatment of Material Collected and 
Analysed by Messrs G. Heymans and E. Wiersma)," Biometriha, 1906 — 1907, v. pp. 
460—469. 
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The method is employed when the grouping is merely by class and 
the different classes have no known relation to one another — in other 
words, when the grouping is merely quaUtative. The order of the 
different quaUties can be changed without making any difference to 
this method. 

The relationship between the two variables is measured by the 
differences between the numbers actually found in the various com- 
partments of the table, and the numbers that might be expected there 
by pure chance. 

To state the rule : 

The total mean square contingency, ^*, of the table is given by 









where n,, = total frequency in ^th row, 

Wj = total frequency in qth column, 

Wj^ = frequency of constituent common to pth row and gth column, 
N = total number of cases in the table. 
Then the coefficient of mean square contingency (7^ is: 

If it is assumed that a normal distribution underhes the classification, 
and if the fineness of grouping is right, then the coefficient C^ is nu- 
merically equal to the correlation coefficient r. 

In the above case, C^ = 0-16. 

The probable error of C^ is very comphcatedf . 

For Ci = 0, P.E. =. -QIU^IVN. 

Instead of calculating the mean square contingency, it is easier though 
not so accurate to calculate the mean contingency. Each quantity 

n -"^ 

* There are certain corrections to 4)", not mentioned here, which often make con- 
siderable difEerence to the result. See K. Pearson, F.R.S., "On the Influence of Broad 
Categories on Correlation," Biometrika, 1913, rx. pp. U6 — 139. The important point is to 
have fine enough grouping, but not so fine as to leave cells with very few or no cases 
in them. 

■)■ J. Blakeman and Karl Pearson, "On the Probable Error of Mean Square Contin- 
gency," Biometrika, 1906, V. pp. 191 — 197. A. W. Young and Karl Pearson, "On the 
Probable Error of a Coefficient of Contingency without Approximations," Biometrika, 
1916, XI. pp. 215—230. 
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is called a subcontingency, and it will be observed that in the formula 
for 02 these were squared. Instead of this, let us, without squaring 
them, add the positive subcontingencies only (for of course the sum of 
the whole is zero), and write 



N*l' = S,,\n,,-'^y 



From ifi, by using Table XXXIV of Pearson's Tables, a value of 
the second coefficient of contingency C^ is read ofE, which, under condi- 
tions similar to those outlined above, also is equivalent to r. 

The contingency method, it must again be emphasised, gives these 
two measures of the connection or association of the qualities considered 
even without any assumption that a continuous variation underHes 
the discrete classification. If however such is assumed, then the approach 
to equahty of C^ and Cg will be a good measure of the normahty of the 
distribution and the suitability as to smallness of our elements of 
grouping. With very fine grouping we get into difficulties owing to having 
to record by units only. 16 to 25 subgroups is a good range. 

It is interesting to find that areas of positive contingency are sepa- 
rated from areas of negative contingency on a normal surface by a 
hyperbola having a simple relationship with the contour elhpses. 



3. Two-row Table*. 

This method gives a unique value of r in the case of two variates 
one of which is both quantitative and continuous (e.g. intelUgence), 
while the other, though quantitative, admits of only two subdivisions 
(e.g. into good and bad visuaUsers), or, in more technical language, is 
"alternative." 

-»- X (intelligence) 























Good 
visualisers 




















Bad 

Tisualisers 






































N 



* Karl Pearson, F.R.S., Biometriha, 1909, vn. p. 97. 
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The assumptions made are two in number : 

(1) that the regression is linear: 

(2) that the distribution of the alternative variate is approxi- 
mately normal or Gaussian. 

The accompanying diagram (Fig. 23) will make the method clearer. 
LL' is the regression hne of x on y, its equation being 

X — X 

-_Zl_ 



r = 



y-y 



X 



Good 
visualisers . 



Bad 
yisnalisers 



L \ 

-/,■ >\ 



^- 



if 



^N 









ST 



-1M 



Fig. 23 

Putting X, the mean of all the a;'s, =f, x = p', the mean value of 
the array of x'a corresponding to the smaller group of y's (good visual- 
isers), and y — y = 1, the distance of the centroid of the area RST from 
the mean MM of the Gaussian curve, we have the formula : 

p-v' 



The numerator of this is known, and the denominator 

1 JXa 



2 C,^ 2,^3. 



(72 ffg X area of part RST 
nt V2Tr 
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where Wj = no. of cases in upper group (area RST), 

and N = total no. of cases. 

In this expression ^ = J (1 + a) in Sheppard's Tables * from which the 

cc 
corresponding value of — can be read; and the value of z corresponding 
fa 

to this latter result gives 



"2 W 



/ = 



V27r 

P-P' 

0-1 




the value of z in Sheppard's Tables corresponding 



to the J (1 - a) of ^ 



) 



The above method might advantageously be used in determining 
the relationship of two characters, such as those suggested (intelhgence 
and visual imagery), which are both quantitative, but of which one 
only admits of reUable division into two groups. It is the smaller of 
these groups (the "tail" of the distribution) which gives the n of the 
formula. In the suggested case, it might be represented by the number 
of individuals, out of the entire group N measured, who successfully 
pass certain somewhat difficult tests of visual imagery. 

In cases where the aj-variate (continuous and quantitative in our 
example above) can only be divided into classes, showing no definite 
order or quantitative relations to one another, the y-variate being again 
quantitative and assumed to follow a normal distribution, but alter- 
native, a modification of the above method gives rj. 

4. Short METHODsf . 

(i) It can easily be shown that 



and therefore that 



2cra,CT5, 



(ii) If the distributions are both normal, and if both variates have 
the same meaning, and the same standard deviation a, then 



r= 1- 



iVV 



* Biometrilca, n. p. 182; Table II of Pearson's Tables. 

■f Karl Pearson, " On Further Methods of Determining Correlation," Drapers^ Company 
Research Memoirs, Biometrio Series, iv. 1907. 
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where S is the sum of the positive differences only. This method might 
sometimes be conveniently used in determining the "individual" corre- 
lation between performances of the same individuals in the same mental 
tests on different occasions. 

5. The Method op Ranks. 

Some years ago the valuable suggestion was made by Professor 
C. Spearman* that measurements of psychical performance may con- 
veniently — nay, preferably — be replaced by the numbers representing the 
rank or order of merit of the individuals in the group. On this basis 

S (xy) 
the ordinary product-monient formula for r, ^ — — , can be easily shown 

to reduce to the form 00/ \2 

P ^ N(m~l) ^"■'' 

where v^ and i/^ are the ranks of an individual in the two series. 

Professor Spearman also suggested a still simpler formula, which he 
calls a "foot-rule" formula. It is 

«=l-|^iV^) (^)' 

where S (g) denotes the sum of the "gains" in rank (sum of positive 
differences) of the second series on the first, and then empirically, by 
noting the distribution of a large number of chance values of S (g), 

'■ = sin(|i?j (y). 

This method of using ranks, and the formulae suggested therefor, 
were vigorously criticised by Karl Pearson in the paper quoted on the 
preceding pagef. 

Some form of frequency distribution must be assumed, and the 
"foot-rule" method assumes that form to be a rectangle. On the assump- 
tion of normal distribution, Professor Pearson shows that 

r = 2singp) (8), 

where p has the value given above. 

* C. Spearman: "Measurement of Association between Two Things," Am. Joum. 
Psychol. 1904, xv; "'Foot-rule' for Measuring Correlation," Brit. Joum. of Psychology, n. 
Pt. I, July 1906. Were it possible to keep B in its place merely as a "foot-rule" whose 
"chief mission is to gain quickly an approximate valuation of r," it would not be harmful. 
But the ease of its calculation leads to its use too frequently "not merely for assay purposes 
as originally contemplated, but even sometimes for research" (see C. Spearman, Brit. 
Journ. of Psychol. 1910, ni. p. 286). 

t Professor Spearman endeavours to meet some of these criticisms in Brit. Joum. 
ofPsycliol. 1910, III. p. 271 sqq. 
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In terms of the sum of positive differences of ranks ("gains" in rank) 
the formula is 

r = 2cos2.{^^}-l 

= 2cos^(l-i2)-l (e). 

This agrees very closely with Spearman's formula (y), and, having 

a definite theoretical basis, should now take its place. 

Professor Pearson has proved that the probable error of p for zero 

•67449 

correlation = , . He is thus able to show that, for zero correlation, 

Vn—l 

f J J J x -7063 

p.E. 01 r deduced from p = , , 

and P.E. „ r „ „ 22=4EEL. 

Vn — 1 

While this book is going through the press, Professor Spearman has 
very kindly sent on to us the proof sheets of portions of a work on 
correlation by Professor Wirth of Leipzig, in which considerable space 
is devoted to the method of ranks, of which the author highly approves 
and which he ingeniously develops in several ways. Although we cannot 
completely share this approval, we can appreciate the thorough exposi- 
tion given and especially the clear distinction drawn between (1) the 
stage of complete measurement, (2) the stage of ranks in which classes 
are arranged in order, and (3) the stage of contingency, in which classes 
of X can be assigned to classes of y, without however being in any order 
of magnitude. 

This shows clearly the position which rank methods take up. The 
controversy concerning these methods really falls into two parts, 
(1) questions of principle and (2) questions of mathematical detail, 
often of primary importance. Leaving the latter aside however, let us 
consider Professor Spearman's claim that ranking is superior to measure- 
ment in psychology because the things measured are not "homogeneous," 
for example the skin's "spatial sense" and its sensitivity to pain*. In 
short. Professor Spearman raises the whole question of mental abilities 
being "magnitudes" or true "quantities" (see Chap. I, p. 11). 

We would reply that there is much truth in what he says. But 
(1) for practical purposes the correlations of the physical signs of mental 
states are themselves often important, whether or no they are the true 
* Brit. Journ. of Psychol. 1906, n. p. 91. 

9—2 
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correlations of the mental states as such, and (2) the physical measure- 
ment often, at least, goes further than giving the mere order of merit 
of the mental states. It frequently ranks the differences also (cf. p. 12). 
Indeed, in many cases, such as the giving of examination marks, there 
is a definite attempt to do more than say that this is the order of merit 
of the competitors — there is an attempt to say something like : — " This 
candidate is easily first, the second and third are close together, the 
fourth a long way behind," and so on. While we are prepared to agree 
that actual marks, in an examination or a test, are hardly likely to be 
true measures of the desired mental state, yet we claim that they show 
often a good deal more than mere ranks. Mental measurement need 
not remain at the second of Professor Wirth's levels, even though it 
may be long before it can reach the first. 

(6) CORRELATION OF SUMS OR DIFFERENCES* 
This article is concerned with the following problem. After calcu- 
lating the correlations between several series of values, it frequently 
happens that we want the correlations given by some of the series added 
together; and differences are not less important than sums. The corre- 
lation of the pool is not the mean of the correlations. The general problem 
is as follows. 

Let the two series of values be denoted by a^, osg ... a,,, and 
6i, 62 ••■ \> sach being measured from its own mean and consisting of 
N cases. Let these variates be multiphed by constants or weights. 
Required the correlation between 

A = njcii + n^a^ + ... + n^jCSj,, 
and B = m^bi + m^2. + ••• + m^h^. 

Since all the a's and 6's are measured from their means, it is clear that 
A and B are also so measured. The required correlation is therefore 

S' (AB) 
*" ~ VS' {A^) VS' (B^) ' 
where the symbol S' indicates summation from 1 to N. We shall retain 
the symbol S, on the other hand, for summation from 1 to ja or 1 to q. 
Consider now the correlation of any particular a with any particular 6. 
It is given by ^ ^ ^^^ 

Naaffi ' 
That is, Na„a,r^, = S' (ab). 

* C. Spearman, Brit. Journ. of Psychol. 1913, v, p. 417. 
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Multiplying the a by its constant n, and the 6 by its constant m, only 
alters the standard deviations of these quantities, not their means, 
since they are already measured from means. We have therefore 

'Nnmajj^raii = S' (na.mb), 

and summing this over the p and q measurements of a and b we get 

NS (nmaa'or^rai) = SS' {na.mb). 

Now the right-hand side of this equation means the sum of all pos- 
sible products of na and mb. But a httle consideration will show that 
this is exactly what the numerator of r, viz. *S' (AB), means. Therefore 

S' (AB) = NS (wmc7„a,r„,). 

The two quantities in the denominator of r are found similarly, or by 
putting A == B in the expression just arrived at, and we have finally 

^^ S (wmc7„g^r„i,) 

^{S {nnaaOaraa) S [mma^a^rti)} ' 

wherein it must be noted carefully that the summations in the de- 
nominator include the correlations of an a or of a 6 with itself, corre- 
lations which of course are unity. Excluding these cases from the 
summations, and recognising that each case like a^at has a twin case afOg, 
we arrive at the following form of the same formula : 

^ S {nmagajr^) 

*■ V{S (wV) + 25 (wn<T„or„r J} V{'8 (mV) + 2/S {mma^a^r^)} ' 

In the form in which this formula will perhaps most frequently be 
used, p will equal q, and both will equal 2, while the weights n and m 
will all be unity. We then have 

The immediate practical uses of the general formula are perhaps not 
so important as the theoretical uses to which Professor Spearman has 
applied it. 



CHAPTER VII 

THE INFLUENCE OF SELECTION 

Influence of mild selection on <t and r — Rigorous selection and partial correlation — 
Three correlated variables represented by dice throws — Multiple correlation — 
Spurious correlation — Variate difference correlation method. 

(1) THE INFLUENCE OF MILD SELECTION 

The essential point about the whole theory of correlation is that it tells 
us how a group of individuals selected from the general population 
according to some characteristic (say as being within certain hmits of 
height, or possessing some mental abihty or manual dexterity in a high 
degree) will also differ from the general population in other charac- 
teristics. 

The ordinary correlation coefficient already tells us much in this 
respect. For example, if the correlation between two abiUties, say 
(1) the abihty, whatever it may be, which is measured by Dr McDougall's 
Dotting Machine and (2) the abihty to memorise Nonsense Syllables 
according to certain experimental regulations, be known to be -4 for 
the whole population, this means that if a group be selected with 
"Dotting" abihty equal to x (measured from the general mean in 
a units) then this group will most probably have an average " Nonsense 
Syllable" abihty equal to -^x (measured in a similar way). 

Clearly in practice we do not usually know the means and the 
correlation for the whole population but only for samples. We take 
large samples and endeavour to ensure that they are random and not 
selected samples from the population we wish to investigate. 

The selection contemplated in the above example is very rigorous: 
all the individuals are presumed ahke in regard to "Dotting" abihty. 
In practice such a rigorous selection never takes place. The boys in 
a school form, for instance, are more alike in say abihty in Latin than 
the general population, yet not absolutely alike. The "scatter" of this 
variate (Latin) has been reduced, yet not to zero. 

Just as selecting a group of individuals for one variate will alter 
the average value of other variates, so it will alter the scatter of these 
other variates, and their intercorrelations. It is this phenomenon which 



PT. II, CH. VII] THE INFLUENCE OF SELECTION 135 

in an extreme form gives us what we already know as "partial corre- 
lation" (seep. 105). 

In fact, selecting a group of individuals within certain limits of a 
quality A implies an indirect and less rigorous but frequently very 
important selection of the other quahties B, C, ... of these individuals 
and of their intercorrelations. Consider the simplest case of three organs, 
A being directly, B and C only indirectly selected. Let subscripts 1, 2 
and 3 refer to A, B and C respectively, and let the standard deviations 
and correlations in the general population be a^, a^, a^, r^^^ '"'w ^''^^ ''si- 
In the selected group o-^ is reduced by the selection to s^, and a^ and a^ 
are indirectly altered to Sg and Sg, r-^^, r^^, r^^ to r^g, Igs, Tsi- Then the 
following formulae enable these quantities to be calculated*. 

Write Sj/ffj = cos xi- 

Then ^a/ffg = sin %2, and Sg/tTg = sin a^, 

where cos a^^ = I'n sin xi, and cos a^^ = r^g sin ^i- 

Further ri2 = cot Xi cot a^g, 

Tis = cot xi cot aj3, 

*'23 ~ COS aj2 COS aj3 
^ sin ai2 sin a^j 

For example let us suppose for the moment that the correlations 
between (1) Classics, (2) Drawing, and (3) Enghsh have, in the general 
population of Enghsh Fourth Form boys, the values found on pp. 104 
and 105, viz. 

ri2 = -42, 

^23 = -21, 

J-si = -78, 

and let us further suppose that, say on a standardised percentage system 

of marking, the standard deviations of the marks in these subjects are 

CTi = 16, 
CT2 = 13, 
a, = 14. 

Now suppose a mild selection of Fourth Form boys to be made on 
the basis of their abihty in Classics, and in the selected group let us 
suppose that the standard deviation in marks in Classics is reduced to 

Si = 12. 

* See Karl Pearson, F.R.S., "On the Influence of Natural Selection on the Variability 
and Correlation of Organs," Phil. Trans. 1902, cc. A, pp. 1 — 66, where an interpretation 
in terms of spherical trigonometry is given. 
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We then have 



Further 
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cos xi = sjoi = 12/16 = 0-75, 

X = 41° 25', 
sin x = 0-66, 
cotx'= 1-14. 

cos a^2 = '"12 sin Xi = "42 x -66 = -28, 

ai2 = 73° 44', sin a^^ = '96, 

cot ai2 = "29 ; 



cos ai3 = ri3 sin xi 
aj^ = 59° 0', 



•78 X -66 = -51, 
sin a^j = -86, 
cot a^g = '60. 



Sj = aa sin a^^ = 13 x -96 = 12-5, 
^3 = o's sin fflis = 14 X -86 = 12; 
Tia = cot xi cot %2 = 1"14 X -29 = -33, 
lis = cotxi cot a^a = 1-14 x -60 = -68, 

•21 - -28 X -51 



1^23 — 



r,3— cosaigcosaj. 
sin %2 sin ajg 



•96 X ^86 



•12. 
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Putting|these results together for comparison we obtain the following 
tables : 

Standard Deviations 



Mild selection Bigorous selection 
Before selection in Classics in Classics* 



Classics 
Drawing 



16 
13 
14 



12 

12-5 

12 




11-8 

8^7 



Classics and Drawing 
Classics and English 
Drawing and English 



Correlations 

Mild selection Bigorous selection 
Before selection in Classics in Classics* 



■42 
•78 
•21 



•33 

•68 
•12 



•21 



This mild selection for Classics has therefore left the scatter of abiUty 
in Drawing almost untouched, but has made the group somewhat more 
homogeneous than it was in Enghsh. The intercorrelations are all 
slightly reduced, that between Drawing and Enghsh being now almost 
nil. 

* See later. 
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(2) RIGOEOUS SELECTION AND PARTIAL CORRELATION 

If we suppose the selection in Classics to be absolutely rigorous, so 
that the resulting group is absolutely homogeneous in abihty in that 
subject, then our formulae simphfy considerably and we are left with 

si = 0, 

.-. Xi = 90°, 

sin xi = 1= 
cos ai2 = ri2, 
cosai3 = r^; 

:. 22 = 072^/(1 -'•12^), 

Txa is meaningless, the variate 1 being _^a!e«i, and similarly r^; 



" 3.1' 



'12'13 _ _ 



'' V(l-ri2')V(l-»-i3') 
This formula was reached by Mr Udny Yule* before the more general 
formulae were known, and is called the "partial" correlation of variates 
2 and 3 for a constant value of 1, and is written r^_^. Mr Yule obtained 
its value by applying to three variables the methods we have already, 
following him, employed on p. 108 for two, using the Method of Least 
Squares. Let the regression equation of z/ on z and x be 

2/ = 623 z + 621a;. 

This gives the normal equations for 633 ^'^^ ^21 

whence, after a Uttle simphfication, 

■L _ ^23 ~ ^21^31 "^2 
"23 — 1 _ „ 2 „ ' 

and a similar expression for 621 • Erom the regression of z on y and x 
we^get in like fashion 

, _ y23 - ^21 ^31 ^3 

and rg3 X = either 623 or 632 when 0-2 = "'3 = unity, so that 

^23 ~ ^21^31 



V(i-V)V(i-V) 



* G. Udny Yule, "On the Significance of Bravais' Formulae for Skew Correlation," 
Proc. Boy. 8oc. 1896, LX. pp. 477 — 489. 
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If we apply these formulae to our three variables (1) Classics, 
(2) Drawing, and (3) English, we shall get the standard deviations and 
correlations for a rigorous selection in Classics, given in the third column 
of the above tables. We see that the group is still heterogeneous in 
Drawing, but a good deal more homogeneous in Enghsh. The correla- 
tion between Drawing and Enghsh has now actually been reversed. 
Needless to say, the actual numbers in this example are not to be taken 
as giving the facts, being only used for the sake of illustrating the method. 

It is important to reahse, and becomes clear from the above con- 
siderations, that all correlations are partial correlations, inasmuch as 
there is always a selection of the group we are working with, for age, 
or race, or social standing, or what not. Indeed even the whole Uving 
population is only a group, surviving from "what might have been," 
by natural selection. This wide point of view will save us from many 
of the errors into which we are apt to fall in handling correlation 
coefficients. 

It is particularly tempting to draw what are usually fallacious con- 
clusions from the comparison of "entire" and "partial" correlations 
as to the underlying factors at work causing the correlations. For 
instance, in the present case one might be tempted to conclude that the 
original positive correlation between English and Drawing was entirely 
due to factors which these share with' Classics, that is, to a general 
factor, and that any direct connection of these two subjects is of an 
"interference" nature. But such conclusions as to the underlying 
mechanism have to be made, if at all, with great reserve, as will be seen 
from examining cases where we have independent and first-hand know- 
ledge of the factors at work, as we have for example in dice throwing. 
A correlation can be set up between two dice throws of m and n dice 
respectively by leaving some of the m dice lying to form part of the 
second throw. 

(3) THREE CORRELATED VARIABLES REPRESENTED BY 
DICE THROWS* 

Let n red dice, n blue, n yellow, and n white dice be thrown, and let 
the variable x be given by the combined red and white, y by the com- 
bined yellow and white, and z by the combined blue and white scores, 
as in Fig. 24. 

That is to say, there is a general factor (the white dice), common 

' This section is an extract from an article by Godfrey H. Thomson, Brit. Joum. 
of Psychol. 1919, IX. p. 323 et seq. 
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to all three variables, which causes all the correlations between them. 
These correlations are 



'XV ' 



yz 'zx 

Red 



'White 




r 

Blue 


Fig. 24 


Yellow 


Purple 
n4- 


X 


Orange 
>» 




Fig. 25 
The partial correlations are, by the well-known formula. 



Tx^.z = ry.-x = r^x.v = (i - i • 4)/V(l - iW) (1 - ii?) = i 
It is not permissible, however, to reverse this statement, and to assume 
that in every case where r^ = r^^ = 'rzx= \ the correlations are formed 
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solely by the action of a general factor. In fact, identically the same 
values can be produced without any general factor at all. Let n purple, 
n green, and n orange coloured dice be thrown, and let the variable x 
consist of the purple and orange, y of the orange and green, and z of the 
green and purple scores combined, as in Fig. 25. 

Here there is no general factor whatever. The connection of x with y 
(through the orange dice) is entirely independent of the connection of x 
with 2 (through the purple dice). 

Yet the correlations, both partial and entire, are exactly the same as 
in the first arrangement, viz. 



'OYll . Z ' -UZ . 3! ' Z£ 



_ 1 

~ 2 



Clearly, therefore, if we only know of three variables x, y, and z, 
formed of dice throws, that their correlations are as above, we cannot 
say with certainty whether a general factor exists or not. Let us now 
consider a more general arrangement of dice, with numbers of different 
colours, viz. W white, R red, B blue, Y yellow, P purple, G green, 
and orange, thus : 

X 



-W. 



Fig. 26 



X consisting of the scores of the W, R, P, 0; y of the W, Y, 0, G, and 
z of the W, B, G, P dice. 

In this arrangement we shall call W a general factor, it being common 
to all three variables; R, Y, and B specific factors, they being unique to 
X, y and z respectively ; and 0, G and P group factors, since each runs 
through a group of (here two) variables. 
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The theoretical values of the correlation between any two of the 
variables, say x and y, can be found by means of the formula* 
Number of dice common to x and w 

f = •■! 

■^^ Geometrical mean of total dice in x and in y ' 
For example, if x is the score of 9»i dice, and y the score of 4w dice, 
In being common, the correlation is 

'" \ = 0-33. 



V9w X 4:Ji 6 

The general arrangement of dice shown in the above figure includes the 
two special cases already considered, viz. 

(1) p = o=^o = zero, R = B^Y==W = n; 
and (2) P = G! = = w, R = B = Y =W = zexo; 

which both give r^„ = Ty^ = r^^ = if- 

Of the infinite arrangements possible with this diagram, an infinite 
number (of a lower order) can in general be constructed to produce any 
given set of positive correlations between x, y and zj. Moreover, all 
these possible ways of producing the required correlations are, in our 
ignorance, equally hkely to have been those used by the person making 
the arrangement of dice, although they are, it is true, not equally 
probable as chance occurrences. From the correlations, therefore, we 
cannot in general deduce what proportion the white dice (i.e. the par- 
ticular colour representing the general factor) bears to the others, for 
this proportion can vary between wide hmits, and give exactly the same 
correlations. The most we could conceivably do would be to give the 

* This formula was proved by me mBrit. Journ. of Psychol. 1916, vm. p. 275, in ignorance 
of any former statement of it. Professor Spearman showed {ibid. p. 282) that it is deduoible 
from a formula of his concerning correlation of sums or differences. I have since noticed 
that Professor Spearman gives the following clear expression of the formula {Am. Journ. of 
Psychol. 1904, sv. p. 75) : " The correlation is always the geometrical mean between the two 
shares." I do not however agree with the application he there proceeds to make. The 
formula can also be directly deduced from Bravais, and in several other ways. It assumes 
that the elements all have the same standard deviation, as dice have. Gr.H.T. 

t If he be so minded, the reader can make thousands of other dice patterns, all giving 
the correlations r^y=r.ijz='^iix=i- 0°^ group is given by 

R=B=Y=W=kn, P=G=0=sn. 
But such a high degree of symmetry is unnecessary. For example, another pattern giving 
the same correlations is 

S=91n, T=91n, B = 5in, P=78re, 0=18n, 0=91re, W = 78n. 
The number of white dice present ranges between the two extreme oases given in the text. 

J A convenient method of finding such arrangements or patterns to give specified 
values of the correlation coefficients is explained by J. Ridley Thompson, Brit. Journ. 
of Psychol. 1919, x. p. 98. 
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"expectation" of the proportion of white dice. The meaning of this 
would be, that if a very large number of arrangements of dice were 
examined, each giving the required set of correlation coefficients, and 
if we assume thai these arrangements of dice are not formed on any flan, 
beyond that they all agree in the correlations they produce, then the 
average proportion of white dice would be that named as the "expec- 
tation " thereof. But if there is any reason to think that the large number 
of cases examined are all of much the same pattern — as there would be 
were they all natural phenomena of the same sort — then the " expecta- 
tion " of the proportion of white dice, becomes useless and meaningless. 
We cannot conclude anything which is of any definite value in con- 
structing the pattern, except give limits within which it must he. 

This brings us to the problem : — Are there any values of r^, r^^ and 
r^j, which make it certain (having regard to their probable errors) that 
at any rate som£, general factor, some number of white dice, exists? The 
answer is that this is so if the correlations are large enough. For example, 
if we take the special case of equahty of the three coefficients, 

'aw '1/2 'zx ' J 

then up to r = ^ the correlations can be imitated by various numbers of 
red, blue, yellow, purple, green and orange dice without any white dice. 
But as soon as the common value r rises above \, some white dice are 
necessary. In this case therefore the proof of the existence of (at any 
rate) some amount of general factor reduces to the examination of the 
probable error of r, to see if r is indisputably greater than \. 

In the more general case the matter is not so simple, the three values 
of r differing from one another. The more detailed examination of this 
case is reserved for treatment elsewhere. It will be found however that 
if the quantity 

is indisputably greater than unity, then some white dice, some general 
factor, may be postulated with certainty*. It may not be out of place 
to remind ourselves again that this, though true of the arrangements of 
dice we are considering, may not be true in the same sense of other 
phenomena, e.g. biological or mental phenomena. 

The following two examples illustrate the above principles. 

* A rough guide is the average, value of the three r's: if this is greater than J some 
general factor certainly exists. The exact condition however is that given in the text. 
It is due, in this form, to Mr J. R. Thompson, see Brit. Jtmm. of Psychol. 1919, ix. p. 335. 
Note that all this only applies to correlations produced by overlapping dice throws, or by 
some sufficiently similar mechanism. 
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Example A 

Three variables, composed of overlapping dice throws, give corre- 
lations as follow: 

r^„ = 0-32, r„, = 0-33, r,^ = 0-54. 
Are any dice common to the three variables? 
In this case we find 

r^^ + fy? + r^^ + 'Ir^ryjr,^ = 0-56, i.e. < 1. 
From this we conclude that these correlations can be imitated either 
with or without a general factor of white dice. The following arrange- 
ments of dice do actually produce these correlations. 

Case (1). R = 19ji, B = lln, Y = 85»i, (specific factors), 
P = G = = zero, (no group factors), 

W = 21n, (a general factor). 

Case (2). R = 15m, B = 15n, Y = 16w, (specific factors), 
P = iln, G = 25n, = 24w, (group factors), 
W = zero, (no general factor). 

Example B 

Three variables, composed of overlapping dice throws, give corre- 
lations as follow: 

r^y = 0-72, r^, = 0-77, r,^ = 0-67. 

Are any dice common to the three variables? 
In this case we have 

ra>y^ + U^ + fJ- + 2r^jr^,»-,^ = 1-93, i.e. > 1. 
We conclude therefore that some white dice common to the three 
variables are present, i.e. that there is a general factor. The following 
arrangements of dice do actually produce these correlations, the general 
factor being a minimum in one and a maximum in the other. 

Case (1). R = 156w, B = lOin, Y = 52w, (specific factors), 
P = G = = zero, (no group factors), 

W = 260w, (general factor). 

Case (2). R = B = Y = zero, (no specific factors), 

P = 90»i, G = 161w, = 123w, (group factors), 
W = 198w, (general factor). 

We see then that if 

the presence of some white dice is certain. If the above quantity, which 
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we shall call D, is equal to or less than unity, the presence of white dice 
is uncertain. Suppose we consider two cases in which 

r^ = 0-8, r,, = 04, r,^ = 0-1, Z) = 0-842, 
and ra^ = 0-2, rj,^ = 0-2, r^^ = 0-l, D = 0-094, respectively. 

Can we in these two cases say anything as to the 'probability of the 
existence of some general factor? 

The answer to this question is twofold. If we suppose that the person 
making the arrangements of dice has, among all the possible arrange- 
ments giving 

r^ = 0-8, r„, = 0-4, r,, = 0-1, Z) = 0-842, 
chosen one by chance selection, and if we suppose that the other 
arrangement giving 

r^ = 0-2, r„, = 0-2, r,^ = 0-1, 2) = 0-094 
has similarly been chosen by chance selection, then it is much more 
probable that a general factor exists in the first than in the second case. 
This probabiHty will in fact rise and fall with B though it is not measured 

byZ>. 

But if the person making the arrangements of dice has any definite 
rules which he follows in making the patterns, then the above probabiHty 
will have much less meaning. In the former instance it would express 
the average to be found in many cases; in the latter it will no longer 
express even such an average value, and will have no practical worth. 

Before leaving for the present the subject of dice throws two points 
may be mentioned. (1) Negative correlations may be imitated by dice 
being added to one, but subtracted from the other, variable*- (2) There 
are many ways conceivable in which correlations can be produced other 
than by common factors. Consider for example the positive correlation 
between the number of hearts in my hand and the number of spades 
in my .partner's, at whist. Doubtless the correlations found in mental 
phenomena are in many instances still more subtle in origin. 

(4) MULTIPLE CORRELATION 

The full significance of correlation is only to be realised after a careful 

study of the general theory of correlation of numerous variables, of 

which the correlation of two variables, measured by the correlation 

coefficient r, is only a particular case. The classic memoir on the subject 

is that by Professor Karl Pearson on "Regression, Panmixia and 

* See J. R. Thompson, "The R61e of Interference Factors in Producing Correlation," 
Brit. Journ. of Psychol. 1919, x. pp. 81 — 100. 
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Heredity," in 1896*, which is however too advanced for quotation at 
any length here. In 1907 Mr G. Udny Yule introduced a new notation^ 
and made various improvements, and his formulae will now be briefly 
summarised. 

The regression equation for the two variates, x and y, both measured 
from their means, was found in a former section to be of the form 

the equation being obtained by an apphcation of the method of least 
squares. 

It is evident that the method thus applied to two variates might, 
without further assumptions, be used in the general case of n variates. 
Thus, if Xj, x^, ... Xn denote deviations from means, the equation ex- 
pressing the regression of Xion x^ ■■■ x„ could be written 

^1 ~ "12.34"-K*2 + "l3.24...«^3 + ••• + "l«.23."«-A- 

" In this notation, the suffix of each regression coefficient completely 
defines it. The first subscript gives the dependent variable, the second 
the variable of which the given regression is the coefficient, and the 
subscripts after the period show the remaining independent variables 
which enter into the equation" (Yule, op. cit. p. 182). 

In terms of the same notation, write 

'"l2.34-"« ~ ("X2.34---«-"21.34---«) ' 
•^1.23 — n = ^1— ("12.34"-«'^2 "I" "• "Im .23 ■■• n-l-^n); 
-"*'' 1.23— n = ^ {^ 1.23— «)• 

Then the normal equations, from which the regressions are deter- 
mined by the method of least squares, may be written 

O (a'2''^1.23— «) ~ '^ ("''3-"'^1.23— tc) =••■=»-> (*m-^1.23 — m) ~ 0- 

From these equations Mr Yule has deduced the following results : 

"12.34 — re — '12.34 — ra „ ' 

"2. 34 -..71 

<'■ 1.23---71 = °'l (1 ~" ^"12 ) (1 ~ *" 13.2) (1 "~ >■ 14.23) ••• (1 ~ >■ lK.23-..n-l)> 
_ ^12.34 — «-l ~ *"!?!. 34-. K-1 ^2m. 34 — M-1 
'■l2.34-« )i(l-r^ \i' 

'"12.34 — « is known as a "partial" correlation coefficient, being the value 
of the correlation between 1 and 2 for constant values of 3, 4 ... w. 
Similarly, &i2,34...n is a "partial" regression coefficient (cf. "partial 

* Phil. Trans, oxcvm. A, pp. 443 — 4591 

t Proc. Soy. Soc. 1907, lxxix. A, pp. 182—193. 

B. & T. 10 
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differentiation" in the Differential Calculus). Knowing the "total" 
correlations r^^, »"i3, r^a, etc., we are enabled to obtain the various partial 
coefficientsby successive substitutions. Thus, in the case of three variables, 
1, 2, 3, we have 



rr 



- r.-r. 



13 '23 



\/l-r,„2\/l 



'13 " ^ '23 

and two similar equations, expressing the value of the correlation be- 
tween two of the variables for a constant value of the third*. If a 
fourth variable be added, we have the further set of equations 



^19. .ad — 



(l-'-\4.3)*(l-'-'24.3)*' 

Perhaps a more convenient formula for obtaining the partial corre- 
lation of two variables for constant values of a third and fourth is 

J. 1'l2 (1 - V) - yi3 (»'23 - y24>'34) " ^14 (^24 - ^23''34) _ 

When the partial correlation coefficients have been determined, the 
regressions can be found by substituting in the appropriate equations, 
and give at once the regression equations. As explained before, a 
regression equation gives the most probable value of one variable for 
given values of the remaining variables, the standard error in such a 
prediction being <Tj_23...n> ^tc. 

An example of the method of applying the above formulae is given 
in the next few paragraphs, where the partial correlations, regressions, 
etc. in the case of four interrelated psychical capacities are worked out 
on hues identical with those illustrated by Mr Yule in his paper. 

Example of Multiple Correlation f 
(Boys, ages 11 — 12; n = 66.) 

1. Crossing through two letters (e and r). 

2. Crossing through four letters (a, n, o, s). 

3. Combination test. 

4. Mechanical memory test. 

Formula for multiple correlation : 

_ ^12.34— n-l ~ ^lw.34...«-1^2w.34...«-l 

»^12.34...«-,, 2 )i(l-r^ \i' 

\^ ' In. 34. ■■«-!/ \^ ' 2n.3i--n-l) 

* For an intereBting representation of correlation between three variables by a model 
showing the distribution of points in space, see 6. Udny Yule, An Introduction to the 
Theory of Statistics, C. Griffin and Co., London, 1911, pp. 241—243. 

t W. Brown, Brit. Journ. of Psychol. 1910, in. p. 317. 
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For four variables this becomes : 
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''12.34 — 



'12.3 



'14.3'24.3 



(1 



14.3 
'■12 



:)*(! 



.2 

' 24.3 



r 



r^^r, 



13 '23 



(l-rx3^)*(l-'-23')*' 
Table I 



Correlation coefficient 


log(l-r») 


12 


0-78 


1-59284 


13 


0-45 


1-90173 


14 


0-40 


1-92428 


23 


0-48 


1-88627 


24 


0-29 


"1-96185 


34 


0-52 


1-86308 



Table II 



Correlation coefficient 
(zero order) 


Product term 
of numerator 


Numerator 


Correlation coefficient 
(first order) 


log(l-r'i) 


12 
13 
23 


0-78 
0-45 
0-48 


0-2160 
0-3744 
0-3510 


0-5640 
0-0756 
0-1290 


12-3 
13-2 
23-1 


0-7199 
0-1377 
0-2308 


1-68281 
1-99169 
1-97623 


12 
14 
24 


0-78 
0-40 
0-29 


0-1160 
0-2262 
0-3120 


0-6640 

0-1738 

-0-0220 


12-4 
14-2 
24-1 


0-7570 

0-2902 

-0-0386 


1-63038 
1-96179 
1-99348 


13 
14 
34 


0-45 
0-40 
0-52 


0-2080 
0-2340 
0-1800 


0-2420 
0-1660 
0-3400 


13-4 
14-3 
341 


0-3091 
0-2176 
0-4154 


i-95639 
1-97893 
1-91774 


23 
24 
34 


0-48 
0-29 
0-52 


0-1508 
0-2496 
0-1392 


0-3292 
0-0404 
0-3808 


23-4 
24-3 
34-2 


0-4027 
0-0539 
0-4536 


1-92316 
1-99873 
1-89996 



Table III 



Correlation coefficient 
(first order) 


Product term 
of numerator 


Numerator 


Correlation coefficient 
(second order) 


log(l-r«) 


12-4 
13-4 
23-4 


0-7570 
0-3091 
0-4027 


0-1245 
0-3048 
0-2340 


0-6325 
0-0043 
0-1687 


12-34 
13-24 
23-14 


0-727 
0-007 
0-272 


1-67345 
1-99998 
1-96662 


12-3 
14-3 
24-3 


0-7199 
0-2176 
0-0539 


0-0117 
0-0388 
0-1567 


0-7082 

0-1788 

-0-1028 


12-34 
14-23 
24-13 


0-727 

0-258 

-0-152 


1-97009 
1-98985 


13-2 
14-2 
34-2 


0-1377 
0-2902 
0-4536 


0-1316 
0-0625 
0-0400 


0-0061 
0-2277 
0-4136 


13-24 
14-23 
34-12 


0-007 
0-258 
0-436 


1-90843 


23-1 
24-1 
34-1 


0-2308 

-0-0386 

0-4154 


-0-0160 

0-0969 

-0-0089 


0-2468 

-01345 

0-4243 


23-14 
2413 
34-12 


0-272 

-0152 

0-436 


— 



10—2 
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The regression equation between changes in intelligence (as measured 
by the combination test) and changes in the other three variables is 

■^3 ~ ''31 . 24 •''1 "1" "32 . 14 •''2 "I" "34 . 12 -^f 

Calculation of regression coefficients. 

CTi = 68-19, cTg = 58-43, ctj = 16-34, a^ = 9-70, 

<^\.23...n = Ol (1 - >-i2^) (1 - As. 2) (1 - »-^4.23) •-- (1 " ^-V . 23 ...«-l)- 
••• '^3.24 = «^3 (1 - '-32^)* (1 - '•^34.2)* 

= 12-77. 
Similarly, 

<^i. 24 =40-83, as. 14 =13-27, 02.14 = 36-27, £73.12= 14-23, <^4,i2 = 8-82. 

■'H.Zi — n — '12.34 — K . 



Again, Oi2.34...n — »*i2.34...K ' 

"2.34...m 



.-. 631.24 = ^31.24^= -002. 

0^1.24 

Similarly, 632. 14 =-099, &34.i2=-703. 

Hence regression equation is 

X3 = -002x1 + "099x2 + '703x4 • 

The standard error (ff3.i24) made in estimating x^ from x^, x^ and x^ 
by this equation 

= <^3(l-%¥(l-^\2.l)*(l- '•'34.12)* 

= 12-78. 

(5) SPURIOUS CORRELATION 
Correlation is said to be spurious when it is due to extraneous con- 
ditions and does not arise directly out of the functions under con- 
sideration. The term is one of relative and not of absolute significance, 
but its appropriateness will become apparent after a consideration of 
the following two examples : 

1. Heterogeneity of material. 

Let us suppose that two distinct groups of children have been 
measured for two characters A and B, and that the mean abiHties, in 
both A and B, are higher in the one group than in the other. Then, 
even if there is no correlation between the two characters, as estimated 
from each group separately, a positive correlation will be obtained from 
the two groups taken together. On the other hand, if the mean abihty 
in A is higher, and the mean abihty in B lower, in the one group than 
in the other, a negative correlation will be obtained by taking the two 
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groups together. The correlation in each case will be due simply to the 
heterogeneity of the material employed. The difference in the mean 
values for the two groups must of course have some cause, such as a 
difference of nationahty, sex, or even locality (within any one town or 
district) from which the children are drawn; or, again, such a cause as 
the difference of discipUne to which the two groups have been subjected 
in the past. These are extraneous conditions and, if measurable, can 
be allowed for by employing the method of partial correlation. As a 
rule, however, they are not easy to determine quantitatively; hence 
their dangerous character. 

2. Index Correlation*. 

This is a form of spurious correlation which arises from the use of 
ratios for measurements. Thus if 



^1= ^' ^2 = ^ 



■2 



■3 "^Z 



and x-^, x^, x^ are uncorrelated with one another, it can be shown that 

s 



the correlation between z-^ and z^, is 



'Jkh^^M'-m" 



a quantity which may be as large as -5. 

As an illustration from psychology, we may mention the correlation 
of errors of observation, either of different individuals or of the same 
individual at different times. Even if the absolute errors are entirely 
uncorrelated, the percentage errors (errors expressed as percentages or 
other fractions of the observed quantity) will often show a large amount 
of correlation. In such a case as this, the important question arises 
whether we are, from other considerations, justified in using ratios (or 
indices) rather than absolute measures. If we are, the index correlation 
is not to be regarded as spuriousf . 

* Karl Pearson, "On a form of Spurious Correlation which may arise when Indices 
are used in the Measurement of Organs," Proc. Boy. Soc. 1897, LX. p. 489. In a note 
following Pearson's paper Sir Francis Galton illustrates the occurrence of index correlation 
by a simple and illuminating example. 

t Spurious correlation of this (as of any) kind can be eliminated by partial corre- 
lation. 
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(6) VARIATE DIFFERENCE CORRELATION, OR THE ELIMINATION OP 
SPURIOUS CORRELATION DUE TO POSITION IN SPACE AND TIME* 

This is a method for determining the correlation of variations from 
the "instantaneous mean," by correlating corresponding differences 
between successive values. If two variates^a; and y are such that 

x = <f>{t) + X, 
y=f(t)+Y, 

where X and Y are the parts of x and y independent of the time t, then 
it can be shown that on certain not unreasonable assumptions, 

if m is large enough, where „Dj. is the mth difference between successive 
values of x, and similarly „!)„. For example : 

2 A 2^a> %I>x t^x 

47 

6 
53 -4 

2 4 

55 -5 

2 -1 

57 -1 

1 
58 

Clearly the number of corresponding cases to be correlated is reduced 

each time. The method is only valid, inter alia, when there are so many 

cases in the x column that this reduction is immaterial. The following 

numerical example will it is hoped serve to make clear any obscure 

points in the above necessarily short account. The correlations between 

differences are worked out until r remains steady for several successive 

differences. 

Numerical illustration of the Variate Difference Correlation Method'f 

The numbers headed Savings and Tobacco respectively are "indices" 
from Professor Georgio Mortara's article in the Giornale degli Economiste 
e Rivista di Statistica, February 1914. As they stand, they include a 
continuous secular- increase both of savings and of consumption of 
tobacco which has occurred in Italy during the period in question, and 
their correlation is -984:. But first differences only correlate to an extent 
•766 as is shown in detail in the following working: 

* Miss F. E. Cave, Proc. Boy. Soc. 1904, lxxiv. p. 407; R. H. Hooker, Joum.. Soy. 
Stat. Soc. 1905, Lxvin. p. 396; "Student," Biometriha, 1914 — 15, x. p. 179; and Anderson, 
Biomeirika, 1914 — 15, x. p. 269, where probable errors are given. 

■f From Beatrice M. Cave and Karl Pearson, Biometrika, 1914 — 15, X. p. 340. 
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Year 


Savingg 

X 


A 


From 
5 


Sq. 


Product 


Sq. 


From 
3 


i^„ 


Tobacco 

y 


1885 


47 
















82 






6 


1 


1 


1 


1 


1 


4 




1886 


53 
















86 






2 


-3 


9 


3 


1 


-1 


2 




1887 


55 
















88 






2 


-3 


9 


15 


25 


-5 


-2 




1888 


57 
















86 






1 


-4 


16 


12 


9 


-3 







1889 


58 
















86 






1 


-4 


16 


8 


4 


-2 


1 




1890 


59 
















87 






1 


-4 


16 


8 


4 


-2 


1 




1891 


60 
















88 






4 


-1 


1 


2 


4 


-2 


1 




1892 


64 
















89 






1 


-4 


16 


8 


4 


-2 


1 




1893 


65 
















90 









-5 


25 


20 


16 


-4 


-1 




1894 


65 
















89 






3 


-2 


4 


8 


16 


-4 


-1 




1895 


68 
















88 






2 


-3 


9 


9 


9 


-3 







1896 


70 
















88 






3 


-2 


4 


8 


16 


-4 


-1 




1897 


73 
















87 






2 


-3 


9 


3 


1 


-1 


2 




1898 


75 
















89 






4 


-1 


1 


1 


1 


-1 


2 




1899 


79 
















91 






4 


-1 


1 


2 


4 


-2 


1 




1900 


83 
















92 






4 


-1 


1 











3 




1901 


87 
















95 






5 











1 


-1 


2 




1902 


92 
















97 






7 


2 


4 


-2 


1 


-1 


2 




1903 


99 
















99 






8 


3 


9 











3 




1904 


107 
















102 






8 


3 


9 


3 


1 


1 


4 




1905 


115 
















106 






12 


7 


49 











3 




1906 


127 
















109 






17 


12 


144 


36 


9 


3 


6 




1907 


144 
















115 






11 


6 


36 


36 


36 


6 


9 




1908 


155 
















124 






13 


8 


64 


32 


16 


4 


7 




1909 


168 
















131 






12 


7 


49 


28 


16 


4 


7 




1910 


180 
















138 






7 


2 


4 


6 


9 


3 


6 




1911 


187 
















144 






5 











16 


4 


7 




1912 


192 
















151 






145 




506 


247 


220 




69 
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In the table the variate x is the Savings Index, y is the Tobacco 
Index. We then have 

Mean of J)^ = 145/27 = 5-37, 
MeanofiZ)^= 69/27 = 2-56. 

Take 5 and 3 as provisional centres, so that c^^ = -37, rfj = ~ "44. 
From deviations from these points we get, as shown in the table, 

S (iD/) = 506, 

S(iZ),A) = 247, 

8 (iZ)/) = 220. 

Therefore the correlation (correcting for d,^ and d,^ is 

,= ?^I±^ = .77. 

-v/(506 - 4) v'(220 - 5) 

Second differences have a small negative correlation, which increases 
till with sixth differences we reach — -431, which seems to indicate that, 
when time has been ehminated, expenditure on tobacco in any year 
means less money saved. 



CHAPTER VIII 

THE CORRECTION OF RAW CORRELATION COEFFICIENTS 

Historical account — The eUmination of irrelevant factors — Reliability coefficients — 
Correction for observational errors. 

(1) HISTORICAL ACCOUNT 

The history of the use of the theory of correlation in Psychology can 
hardly be said to have begun earher than the commencement of the 
present century. During the previous twenty years, indeed, a great deal 
of work had been done by many observers in measuring simple mental 
abilities (by the "mental test" method) in larger or smaller groups of 
subjects, and attempts had even been made to determine in what way 
these abilities were related to one another and to more general mental 
ability, or " general intelligence." 

Owing, however, to a universal lack of knowledge of the mathe- 
matical theory of correlation among psychologists during this period, 
the results were not obtained in a form suitable for comparison with 
one another, so that it is not surprising to find that they hopelessly 
contradict one another. The heterogeneity of the material worked 
with, the non-elimination of irrelevant factors, and the absence of any 
measure of the "probable error" of the results make the conclusions 
drawn by the investigators themselves from their researches utterly 
unreliable. 

The first investigation showing any mathematical precision was that 
pubUshed by Clark Wissler* in 1901. It contained (inter alia) an account 
of the careful application of a large number of simple mental tests upon 
over 200 college students, and a correlation of the results with one 
another and with the students' marks in the various subjects of the 
college curriculum. The mental tests were found to correlate but slightly 
with one another or with abiUty in college subjects of study, though 
these latter showed considerable correlation Avith one another (-SO — 75). 

In the following year Aikens and Thorndikef published results which 

* Clark Wissler, "The Correlation of Mental and Physical Tests," Psychological 
Review, Monograph Supplement, m. No. 16, June 1901. 

j " Correlations among Perceptive and Associative Processes," Psychological Review, ix. 
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were in a sense confirmatory of those of Wissler, since, notwithstanding 
the greater similarity to one another of the functions investigated than 
in Wissler's research, the correlations were again found to be low. For 
example, different tests devised for the measurement of "speed of 
association " were found to show hardly any correlation — a result which 
seemed to furnish some justification for the author's statement that 
" quickness of association as an abifity determining the speed of all one's 
associations is a myth" {op. cit. p. 375). A similar lack of close relation- 
ship was found in the case of other mental functions which would, on 
the evidence of introspection alone, be confidently classed as particular 
instances of the same general mental function. 

In 1904 there appeared an epoch-making article by Professor C. 
Spearman*, the ideas originating in which have, at least in England, 
ever since dominated correlational work in its apphcations to psychology. 
Since we shall have frequent occasion in the course of this and the 
succeeding chapter to take exception to Professor Spearman's theories 
and mathematical methods, which appear to us incorrect and harmful, 
we may perhaps be allowed at this point, before embarking on contro- 
versial matters, to express our opinion that only Professor Spearman's 
enthusiasm and originality could have given to psychological correlation 
research the life and activity which it has shown during the last fifteen 
years. His work has stirred up both disciples and opponents to investi- 
gations which would otherwise never have occurred to them. 

The new ideas in question fall into two main groups. 

(1) Corrections to the raw values of correlation coefficients. Instead of 
measuring large numbers of individuals, as his predecessors had done. 
Professor Spearman contented himself with small numbers, groups of 
less than 40 in his first research, and as few as 11 in the second; but he 
proposes to make up for the unreliabiUty thus introduced into the 
results by a more careful measurement of his cases, and the application 
of "corrections" to the "raw" values of his correlation coefficients by 
means of appropriate mathematical formulae. 

(2) The discovery of " hierarchical " order among correlation coefficients, 
and the Theory of General Ability, or the Theory of Two Factors, which 
has been built up on this foundation. This theory has been a great incentive 
to research, and may possibly correspond to the facts, though we do 
not incline to think so. But its deduction from the occurrence of 
"hierarchical" order among the correlation coefficients is invalid, as 

*i "General Intelligence objectively determined and measured," Amer. Journ. of 
Psychol. XV. pp. 201—292. 
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will be shown in the next chapter. In the present chapter we turn to 
the closer consideration of the first group of ideas, the correction of raw 
correlation coefficients. 



(2) THE ELIMINATION OP IRRELEVANT FACTORS 

The first kind of correction is that for the elimination of irrelevant 
factors, and is nothing new in the theory of correlation, being simply 
the method of partial correlation described in the last chapter. 

For example, to eUminate the effects of difference of age in the group 
experimented upon, one would determine the partial correlation between 
the two characters under consideration, for "age constant," by means 
of the Yulean farmula given on p. 137. A similar procedure is needed 
for ehminating the effects 6f difference of sex, etc. A preferable course, 
however, would be to dispense as far as possible with the necessity for 
such corrections by selecting groups of individuals of the same age, 
sex, etc. Indeed, as Professor Spearman very properly points out*, 
the partial correlation formula must on no account be used in cases 
where there is too violent heterogeneity of the irrelevant factor. For 
example, we might with justice use it to eliminate the effects of age in 
a group where the extreme differences of age were only over a range of 
two or three years, but not in a group where the subjects ranged from 
say five years to fifteen years of age. 

(3) RELIABILITY COEFFICIENTS 

The second kind of correction introduced by Professor Spearman is 
a correction for what he calls "accidental" errors. This correction is 
based on the "reliabiUty coefficients" introduced by Professor Spearman, 
and as these rehabihty coefficients are in themselves a very useful con- 
ception quite apart from their employment in this connection, we take 
this opportunity of explaining them. 

Anyone who has carried out psychological experiments, or even an 
ordinary examination, on the same subjects with similar tests on two 
or more different occasions does not need to be reminded that the results 
will differ, sometimes very decidedly. The order of merit of the children 
in a test to-day will not be the same as their order of merit in a closely 
similar test to-morrow. Unless however the differences are only slight, 
it is clear that the test or examination is of no practical use. Its re- 

* "Demonstration of Formulae for the True Measurement of Correlation," Amer. 
J (mm. of Psychol. 1907, xvin. especially p. 166. 



156 COERELATION [pt. ii 

liability can be conveniently measured by the correlation coeflacient of 
the marks obtained on the two different occasions. Such a correlation 
coefficient is called a reUabihty coefficient. In practice tests which give 
rehabihty coefficients lower than 0-6 are useless, and the ideal would 
be a great deal higher than this. 



(4) CORRECTION FOR OBSERVATIONAL ERRORS 

It is by the aid of these rehability coefficients, as we have said, that 
Professor Spearman carries out the calculations which have for their 
object the elimination of observational errors. 

It is clear that if the correlation between two series of quantities is 
really perfect, then any observational errors made in jmeasuring these 
quantities can only, and will actually, reduce the correlation. The 
amount of this reduction will probably be greater, the greater are the 
observational errors. The size of these is however indicated to some 
extent by the rehability coefficients, so that it is but a short step to use 
these to correct for the reduction, and enlarge the correlation coefficient 
to its true value. An example given by Professor Spearman himself* 
shows this so clearly that it is not out of place to repeat it here. 

"A target was constructed of a great many horizontal bands, 
numbered from top to bottom. Then a man shot successively at a 
particular series of numbers in a particular order. Clearly, the better 
the shot, the less numerical difference between any number hit and that 
aimed at; now, just as the measurement of any object is quite appro- 
priately termed a ' shot ' at its real value, so, conversely, we may perfectly 
well consider the series of numbers actually hit in the hght of a series 
of measurements of the numbers aimed at. When the same man again 
fired at the same series, he thereby obtained a new and independent 
series of measurements of the same objects. Next, a woman had the 
same number of shots at some set numbers in a similar manner. If, 
then, our above reasoning and formulas (see below) are correct, it should 
be possible, by observing the numbers hit and working out their corre- 
lations, to ascertain the exact resemblance between the series aimed at 
by the man and the woman respectively. In actual fact, the series of 
numbers hit by the man turned out to correlate with those hit by the 
woman to the extent of 0-52; but it was noticed that the man's sets 
correlated with one another to 0-74, and the woman's sets with one 
another to 0'36; hence the true correspondence between the set aimed 

* Amer. Jowrn. of Psychol. 1904, xv. p. 271. 
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at by the man and tliat aimed at by the woman was not the raw 0-52, but 

0-52 
^ ^ V(0-74 X 0-36) ^ ^'^^' 
that is to say, the two persons had fired at exactly the same series of 
bands, which was really the case." 

It will be seen that the formula for correction is "divide the raw 
correlation by the geometrical mean of the two rehabihty coefficients." 
This formula, or rather the more accurate formula of which it is an 
approximate form, we shall prove presently. Meanwhile some general 
remarks on it may be made. Clearly, the formula will always increase 
the raw correlation, supposing, as is in practice usually the case, that 
the correlations concerned are all positive. As we have seen, it can at 
once be admitted that a perfect correlation will of course be reduced by 
observational errors, so that the raw correlation wiU need an increase. 
Similarly, it may on common-sense grounds be allowed that even when 
the true correlation is not perfect but only very high, the same will be 
true, but with less and less certainty as the correlation is really less and 
less perfect. 

In the example given, the true correlation was really perfect, and 
therefore it is clear that any upward correction will be an improvement, 
provided that it does not go past unity. But suppose we take a case where 
the true correlation is not unity, but some less amount. The following 
is an actual trial. Again two persons fired at a target, a man and a 
woman. The correlation between the series of numbers aimed at by 
the man and that aimed at by the woman, was 0'74. The measured 
correlation however between the series really struck by the man and that 
really struck by the woman was 0-65. The rehabihty coefficients were, 
for the man, 0-69 and for the woman, 0-79. The corrected value of the 
measured correlation is therefore 

0-65 ._ o.oq 

'-V(0-69x0-79)-^^^' 

which overshoots the mark, being 0'14 greater than the true value, 
whereas the uncorrected value was only 0*09 too small. 

The fact is, that the assumptions underlying Professor Spearman's for- 
mula are never fulfilled in practice. What these assumptions are can best 
be seen from the elegant proof given for the formula by Mr Udny Yule*. 

* See Appendix (e) to Professor Spearman's article, Brit. Journ. of Psychol. 1910, rn. 
p. 294. There is a printer's mistake of omission in the last formula. See also W. Brown, 
"Some Experimental Results in Correlation," Comptes Bendus du F/"" Congr^ Inter- 
national de Psychologie, Geneve, 1910, where the same proof is quoted. 
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Xi and y^^ are measures of x and y at a certain series of measurements, 
ajg and 2/2 are measures of x and y at another series of measurements. 

Let Xi = X + Si, X2 = X + §2, 

2/1 = y + «!. yi = y + ^2. 
all terms denoting deviations from means. 

Then, if it is assumed that S, e, the errors of measurement, are 
uncorrelated with one another or with x or y, 

S {xS) etc. = 0, 
S (xit/i) = S (xy). 

Hence ^xivi^^xi^^vi = I'xv'^x'^v^ 

and similarly r^, ^^ a^ a^, = r^y a^ Oy, 

'^X2yi^X2^yi ^^ "^xy^x^yf 

But also, since S (058) = 0, S {x-^^ = S (x^), 
and 'r^^^c7^^a^ = cr^^ 

or °x^o^^=- — and Oy^ay^ = -— , 



or 



Geom. Mean of correlation coefl&cients 



°™ Geom. Mean of reliability coefficients 
It is this formula which is employed in the example given above of 
shooting at a target, the numerator there however consisting only of 
»'(a:i+x2)(j/i+»2)> *t® subcalculations for r^^y^ etc. not being made. 

Attention must be drawn to the assumption that the errors of 
measurement 8 and e are uncorrelated with each other or with x or y. 

Now, these are very large assumptions to make. Even in cases where 
the quantities 8, e are genuine errors of measurement, there are strong 
reasons for assuming (on general principles and also from experimental 
evidence)* that they will be correlated. But in the case of almost all 
the simpler mental tests the quantities 8 and e are not errors of measure- 
ment at all. They are the deviations of the particular performances from 
the hypothetical average performance of the several individuals under 
consideration. Thus they represent the variability of performance of 

* See Karl Pearson, "On the Mathematical Theory of Errors of Judgment, with 
special reference to the Personal Equation," PMl. Trans, oxovui. A, pp. 235 — 299. 
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function within the individual. When an individual in the course of 
three minutes succeeds in striking through 100 e's and r's in a page of 
print on one day, and 94 under the same conditions a fortnight later, 
there is no error of observation involved. The numbers 100 and 94 are 
the actual true measures of abihty on the two occasions. The average 
or mean abiUty, which is the more interesting measure for the purposes 
of correlation, is doubtless different from either, but that does not make 
the other two measures erroneous. Evidently in these cases 8 and e 
represent individual variability, and to assume them uncorrelated with 
one another or with the mean values of the functions is to indulge in 
somewhat a priori reasoning. 

There are two comparatively simple ways of testing the assumption : 

(1) S (Xjyj) = S {xy) = S (x^y^), 

.-. S (x-^yi) — S (x^y^) should = within the limits of the probable 
error of the difference. 

Brown has apphed this test to the case of correlation between 
accuracy in bisecting hues and accuracy in trisecting them in 43 adult 
subjects. 

Here S (bjtj) - S {hj,^) = 137780 - 60036 

= 77744, 

p.E. of S {xy) = -67449 J'PmZLl^S^^ in Pearson's notation, 

•67449 1 8 {xyf S (x^) S (y^) 

~ y/n V w n^ ' 

p.E. of ,S (&i<i) = 687 ; p.E. of 8 {\t^) = 365 ; 



.-. p.E. of S (b^tj) - S (b^t^) = -\/6872 + 365^ 

= 778. 

Since 778 is less than one-third of 77744, the formula cannot be 

employed to obtain the correlation between mean abihties in bisecting 

and trisecting lines. 

S {{Xi - x^) (yi - y^)} 

^^' 'fi-¥rvs(x,-x,r.siy,-y,r 

= 0, 
if errors are uncorrelated with one another (since numerator then = 0). 
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Applying this test to the same case of bisection and trisection, 
Brown gets 

rB,-B, = 0-30 ± 0-09, 

which proves once more the inapplicabiUty of the formula. 

Brown appUed test (2) also to the case of correlation between speed 
of addition of figures and accuracy of addition in a group of 38 school- 
children (girls between the ages of 11 and 12) and found 

rs,-s, = 0-35 ± 0-09. 

Even when test (2) does give the value 0, we can only conclude from 
this that 

S (Siei) + S (8,^2) = -S (S^e,) + -S (8,€i). 

Replying to these and other criticisms of his formula for ehminating 
observational errors, Professor Spearman* admits that many forms of 
error will be correlated with each other and with the true values of the 
quantities measured. But these, he says, are generally of a continuously 
progressive nature, and he proposes to ehminate their influence by 
making at least three measurements of x, and taking the first and third 
of these together as the x-^ of the formula, the middle one as x^; or to 
use more comphcated but essentially similar devices. No doubt, of 
course, this is a wise precaution to take, even though sceptics may still 
doubt whether the remaining so-called "accidental" variations are even 
yet uncorrelated with each other and with x and y. 

Brownf, using experimental data obtained by two independent 
observers estimating the lengths of fines, found a considerable corre- 
lation ratio between the errors of observation and the true lengths of 
the fines, the regression being very far from finear. He is of opinion 
that "no assumptions as to the correlation or non-correlation of such 
deviations are in the least justified." 

The correlation table between lengths and errors, and the corre- 
sponding regression curves, are shown in the following table and dia- 
gramj. 

In a review of Brown's article Mr J. R. Wilton§ has made some 
ingenious suggestions for weighting the different cc's in a manner sug- 
gested by quadrature formulae. He remarks also that a grouping 
should be sought which would as far as possible satisfy the assumptions. 

* Brit. Journ. Paychol. 1910, m. p. 271. f Brit. Joum. Psychol. 1913, vi. p. 223. 

t From Brit. Joum. Psychol. 1913, vi. pp. 236—8. 
§ Journ. of Exp. Pedag. 1914, n. p. 302. 
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It may finally be pointed out that even if the errors 8 and e were 
known with certainty to be uncorrelated with each other and with the true 
values, yet, with such small numbers of cases as are used in many of the 
psychological researches in which Professor Spearnmn's formula has been 
employed, the chance of, e.g. S (x^y^) being nearly equal to S [x^^ is 
exceedingly small, and it is difficult to attach any meaning to an artificial, 
post hoc separation of the data into halves such that this condition is 
satisfied (and also others). The formula is at any rate inapplicable to 
samples such as 30, 24, 52, which have been freely used in experimental 
work ; or if used at all, it can only be as a guide to the sufficiency of the 
sample. It is an essential of good work to use such samples that correc- 
tions to the raw values obtained are unimportant. 

Regression Curves of Correlation Table* 
>■ Errors made, in mm. 
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CHAPTER IX 

THE THEORY OF GENERAL ABILITY 

TMs apparent unity is illusory. Man, in fact, is a microcosm as complex as the 
world which is mirrored in his mind; he is a federation incompletely centralised, a 
hierarchy of numerous and conflicting passions, each of which has ends of its oiim, and 
each of which, separately considered, would give a different law of conduct. He is in 
some sense a unit, hut his unity is such as to include an indefinite number of partly 
independent sensibilities. 

Leslie Stephen, The Science of Ethics, p. 69. 

Discovery of "Hierarchical" order among correlation coefficients — Use of the 
formula for the correction of observational errors to prove the existence of a 
general factor — Researches between 1904 and 1912 — A criterion for hierarchical 
order applied to numeroiis researches — Complications in the original theory. 

(1) DISCOVERY OP HIERARCHICAL ORDER AMONG CORRELATION 

COEEPICIENTS* 

The controversy as to whether abihty in any individual is general, or 
specific, or in groups or " faculties " is a very old one, but for the purposes 
of the present chapter it is not necessary to go back prior to 1904, in 
which year there was pubhshed the first f of a series of articles in which 
Professor C. Spearman has developed his Theory of General Ability, or 
Theory of Two Factors, as it is alternatively named. 

Professor Spearman's method in that paper was to measure a number 
of mental abilities, some of them school subjects, others artificial tests, 
in a number of persons, and calculate the correlation coefficients of each 
of these activities with each of the others. These correlation coefficients, 
he then noticed, had a certain relationship among themselves, a relation- 
ship which may be called hierarchical order, and is explained in detail 
later. He saw, quite rightly, that the presence of a general factor would 
produce this hierarchical order among the coefficients, and, reversing 
this argument, he concluded that the presence of hierarchical order 
proved the existence of a general factor. 

In this first series of investigations Professor Spearman used the 
following groups of subjects: 24 village school-children of both sexes, 

* Sections 1, 3 and 5 of this chapter are largely extracts from an article by G. H. 
Thomson in the Psychological Review, 1920, xxvn. p. 173. 

t "General Intelligence objectively determined and measured," C. Spearman, Amer. 
Journ. Psychol. 1904, XV. pp. 201—293. 
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age limits 10-0 to 13-10; 23 boys of a high class preparatory school, 
age limits 9-5 — 13-7; and 27 adults of both sexes, age hmits 21 — 78. 
The tests employed were those for pitch discrimination, weight dis- 
crimination, and discrimination of Ught intensities; and measures of 
intelUgence were obtained, in the case of the children, from results of 
school examinations, grading by teachers, and grading of one another 
by the children themselves (measure of common sense). 

The various school subjects in the preparatory school were found to 
correlate highly with one another, and when, with the inclusion of pitch 
discrimination and music, they were arranged in rows and columns, it 
was found possible to place them in such an order that the correlation 
coefficients formed a hierarchy, each being (with very few exceptions) 
greater than any to the right of it in the same row, or below it in the 
same column, thus 





Classics 


French 


English 


Math. 


Discrim. 


Music 


Classics .. 


— 


0-83 


0-78 


0-70 


0-66 


0-63 


French .. 


0-83 


— 


0-67 


0-67 


0-65 


0-57 


English .. 


0-78 


0-67 


— 


0-64 


0-54 


0-51 


Math. .. 


0-70 


0'67 


0-64 


— 


0-45 


0-51 


Discrim... 


0-66 


0-65 


0-54 


0-45 


— 


0-40 


Music 


0-63 


0-57 


0-51 


0-51 


0-40 


— 



This fact of "hierarchical" order which he had thus discovered was 
taken by Professor Spearman to indicate the presence of some common 
fundamental function which saturates in different degrees the different 
activities, and is the sole cause of correlation between them except in 
the case of very similar activities. 

It can easily be shown that if all the correlations are due solely to 
one common or general factor, then the correlation coefficients will be 
in perfect hierarchical order*. 

Let a and p be two mental tests or other activities, and g be the 
general factor. Then r^^.g is the correlation that a would have with p 
for constant g, and equals 



V(i-^/)V(i-0' 

But if g is the sole source of correlation, r^^^g must be zero, i.e. 

Similarly r^j, = r^grgg. 

Hence 'k = !!«» and similarly = ^. 

* Hart and Spearman, Brit. Jawrn. Psychol. 1912, v. p. 58 quoting Yule. Previous 
though less satisfactory proofs had also been given by Spearman. 
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A little consideration of this last equation shows that, if it be true 
for any four of the tests, it imphes the possibiUty of arranging the corre- 
lation coefficients in the order we have termed "hierarchical" : and more 
than this, that the values of r in any one column of the "hierarchy" 
will bear a constant ratio, each to each, to their partners in any other 
column. y 

Since clearly 'perfect hierarchical order cannot be expected in any 
experimental research, it becomes important to know what deviation 
from perfection can be allowed without giving up the idea of a general 
factor : or on the other hand, what approach to perfection can be attained 
without the presence of a general factor. These questions will occupy 
us presently. Meanwhile we turn for a while to another form of argument 
used by Professor Spearman. 



(2) USE OF THE FORMULA FOR THE CORRECTION OF OBSERVATIONAL 
ERRORS TO PROVE THE EXISTENCE OF A GENERAL FACTOR 

He considered that by the use of his formula for the correction of 
observational errors he could demonstrate the same thing (the existence 
of a "central function"), and could in particular show "that the common 
and essential element in the intelhgences wholly coincides with the 
common and essential element in the sensory functions*." The method 
of proof is as follows : 

Let x^, x^ be two distinct measures of sensory discrimination, and 
y^, 2/2 two distinct measures of intelHgence. 

Then the correlation of the function common to the functions 
measured by x^, x^ with the function common to the functions measured 
by y-^, y^ is equal to 

"^ '^xivi ^(tiva ^iCaKi '^XjUj 

and if the two functions referred to are identical this expression should 
be equal to unity. 

In the present article, Spearman uses a simphfied formula, 



•^ ''iciica ''»ii/2 



and puts for the numerator the average of the various correlations 
* Amer. Journ. Psychol. 1904, xv. p. 269. 
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evaluated between the intelligences and the discriminations, and in the 
denominator puts 

'>'xiX2 = tli6 average correlation of the intellective gradings with one 
another, 

'''viVi — tli6 average correlation of the gradings in discrimination with 
one another, 
and in this way gets results approximately equal to 1 in the different 
groups tested. 

One or two remarks may appropriately be made here. In the first 
place, the full formula is the only one that can be used with any meaning 
or justice since it is the only one which issues logically from the mathe- 
matical proof. In the second place, the applicability of the true formula 
must be considered in the Ught of its presuppositions (mentioned above, 
p. 158). 

Indeed, the assumption that 8 and e are uncorrelated with each other 
or with X ox y seems even more unwarrantable here than in the case of 
"correcting" coefficients, for which the formula was originally devised. 

(3) RESEABCHES BETWEEN 1904 AND 1912 
A number of experimental researches on these Unes, in some of 
which Professor Spearman himself took part, were carried out during 
the eight years following 1904, but with very conflicting results, some 
experimenters finding the hierarchical order among the coefficients, 
others finding no such order. Two articles of this period, for example, 
are those of Mr Cyril Burt*, who found practically perfect hierarchical 
order, and Dr William Brownf , who found small trace of such order. 
A similar conflict of opinion was found with regard to the alternative 
method of attack, as for example in the research by Messrs Thorndike, 
Lay and DeanJ. The subjects examined were 37 young women students 
and 25 high school boys. The tests for sensory discrimination were: 

(1) drawing hues equal to given fines, and 

(2) filfing boxes with shot to equal in weight standard weights; 
those for intelfigence were : 

(3) judgment of fellow-students, and 

(4) judgment of teachers. 

* Cyril Burt, "Experimental Tests of General Intelligence," Brit. J<mm. Psychol. 
1909, m. pp. 94r— 177. 

t William Brown, "Some Experimental Results in the Correlation of Mental Abilities," 
Brit. Joum. Psychol. 1910, m. pp. 296—322. 

X Thorndike, Lay and Dean, "The Relation of Accuracy in Sensory Discrimination 
to General Intelligence," Amer. Jonrn. Psychol. July 1909, xx. pp. 364 — 369. 
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For the high school boys (3) and (4) were combined teachers' and 
fellow-students' judgments and school marks, respectively. 

In the first case, Spearman's formula gave for the correlation of the 
factor common to (1) and (2) with that common to (3) and (4) the valne 
0-26 instead of 1-00. In the second case, the value was 0-29. Moreover, 
Thorndike found a much higher correlation between discrimination of 
lengths and discrimination of weights than between either one of them 
and general intelligence, the coefficients being 

Accuracy in drawing fines, intelhgence 0-15, 

Accuracy in making up weights, intelUgence ... 0-25, 
Accuracy in drawing fines and making up weights 0-50. 

Thus the results were in decided conflict with both parts of Spearman's 
concluding statement "that all branches of intellectual activity have 
in common one fundamental function (or group of functions) whereas 
the remaining or specific elements of the activity seem in every case 
to be wholly different from that in all the others*." 

Thorndike sums up as follows: "In general there is evidence of a 
complex set of bonds between the psychological equivalents of both 
what we call the formal side of thought and what we call its content, so 
that one is almost tempted to replace Spearman's statement by the 
equally extravagant one that there is nothing whatever common to all 
mental functions, or to any part of themf." 

Things were in this very unsatisfactory state when an important 
article by Professor Spearman, in cooperation with Dr Bernard Hart, 
appeared in 1912f . In this article the difficulty of making an unbiassed 
judgment as to the presence or absence of hierarchical order was recog- 
nised, and a form of calculation was given for obtaining a numerical 
criterion of the degree of perfection of hierarchical order, which criterion 
would be independent of any bias on the part of the calculator and would, 
it was hoped, give the true amount of hierarchical order, corrected for 
the sampfing errors of experiment. This criterion ranges theoretically 
from zero, for absence of hierarchical order, to unity, for perfection 
of hierarchical order. But their formula can, arithmetically, exceed 
unity. 

* C. Spearman, Amer. Journ. Psychol, xv. p. 284. 
t Op. cit. p. 368. 

X "General Ability, ita Existence and Nature," by B. Hart and C. Spearman, Brit. 
Journ. Psychol. 1912, v. pp. 51 — 84. 
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(4) A CRITERION FOR HIERARCHICAL ORDER 

The underlying idea was that if the above square table * of correlation 
coefficients shows hierarchical order in any degree, there will be corre- 
lation between the columns of that table taken in pairs, and that when 
the hierarchical order is perfect the columnar correlation R will rise to 
unity, except in so far as it is blurred by the sampUng errors, which 
obviously cannot increase an already perfect correlation, but can only 
decrease it. Let us write dashed letters throughout for the true values 
of the various quantities, which in ordinary experiment are unknown, 
reserving undashed letters for their measured values. We then have: 

r' = true correlation coefficient, 
e = its sampHng error on one occasion, so that 
r = r' + e, 

r' = mean of the column of true values r', 
r = mean of the column of observed values r. 

In finding these means, that coefficient is omitted which has no partner 
in the column with which correlation is being found. Write also 

p' = r' measured from the mean of the true column, i.e. 

= r' — r', and similarly 
p = r measured from the mean of the observed column, i.e. 

= r-r, 
e = p - p', = e - e, 

where e is the mean of the column of e's. 

Then for two columns a and 6, the true columnar correlation which 
we desire to know is 

p ' _ " (p-ca p^ ) ,,, 

"* V{S{pJ^)S{p^'^)} ^ ^' 

by the Bravais-Pearson product-moment formula, S indicating summa- 
tion over the various values of x, i.e. summation up the column. This 
can be written 

p ' ^ iPxaPaSb) — S (eiroeaa ) — ^ (Picb' ^xa) — S (pxg' eg,) 

Hah = - — — — 



V{S{pxaPxa) - S{e^€^) - ^S{pj€^)}\/{S(p^p^) - S{e^ea,) - 2S{p-a,'€^)}' 

In this expression, the three quantities of the form S (pp) are known. 

The three quantities of the form S (ee) are not known, but an attempt 

can be made to estimate their probable values from the known standard 

deviations of the correlation coefficients. The four quantities of the 

* p. 165. 
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form S (p'e) are treated by Dr Hart and Professor Spearman, in their 
paper, as negligible, on the ground that p' will not in general be correlated 
with €. This assumption, as we shall presently see, was erroneous. 

The formula at which Dr Hart and Professor Spearman eventually 
arrive, after neglecting these quantities and making various other 
assumptions, is 

"' V{S (Pxa^) - (n - 1) o^} V{S (pj") -in-l) ^^} '' 

where the a's are standard deviations of the correlation coefficients, the 
bar indicates mean values for the column, and n is the number of 
pairs of correlation coefficients concerned, in the two columns. In using 
their formula, its authors do not apply it to all the pairs of columns 
in the square table. They say: "In any case the correction must be 
kept within hmits : as usual, the larger the correction the less it is to be 
trusted. If the samphng errors are large enough, they eventually will 
quite swamp the true differences of magnitude upon which the observed 
correlation should be based. In this case, the true correlation is beyond 
ascertainment; any attempt at correction is merely illusory. To avoid 
this, and at the same time to ensure impartial treatment of all data, 
it is necessary to fix beforehand some definite Kmit to the feasibihty of 
correction. We have here adopted the following standard: in order to 
attempt to estimate the correct correlation between columns, it is 
required that in each of these columns the mean square deviation should he 
at least double the correction to he applied to that deviation." 

That is to say, the equation (2) is not to be used unless, in each factor 
of the denominator, S (p^) is at least double its correction (n — 1) a^- 
This condition (the "correctional standard") will be found to be im- 
portant. 

The authors appUed their criterion to all the experimental work 
available, work dating from various periods, and representing the re- 
searches of 14 experimenters on 1463 men, women, boys and girls. 
From beginning to end the values of the criterion were positive and very 
high. The mean was almost complete unity. That is to say, Dr Hart 
and Professor Spearman claimed that all the data then available showed 
perfect hierarchical order among the correlation coefficients, even the 
data of workers like Dr Brown and Professor Thorndike, who had been 
unable to detect any such order. The reasons why the hierarchical order 
among the correlation coefficients was not obvious at a glance were, 
according to these authors, two. In the first place, their theory did not 
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entirely deny the presence of Group Factors of narrow range, and tests 
which were too similar were, according to them, to be pooled, before the 
hierarchical order would become apparent. Onlyin veryf ew cases however 
did they find it necessary to pool tests in the data used. In the second 
place, the obscuring of the perfect hierarchical order was, according to 
them, due to the fact that only a small sample of subjects is examined. For 
this error allowance is made in the formula for calculating their criterion. 

Dr Hart and Professor Spearman therefore considered their " Theory 
of Two Factors" proved. This theory considers abihty in any activity 
to be due to two factors. One of these is a General Factor, common to 
all performances. The other is a Specific Factor, unique to that par- 
ticular performance, or at any rate extending only over a very narrow 
range including only other very similar performances. "It is not 
asserted," they say, "that the General Factor prevails exclusively in 
the case of performances too alike, but only that when this hkeness is 
diminished, or when the resembhng performances are pooled together, 
a point is soon reached where the correlations are still of considerable 
magnitude, but now indicate no common factor except the General one." 

In the same paper Dr Hart and Professor Spearman consider, and in 
their opinion confute, two other theories, (a) the older view of Professor 
Thorndike, viz. a general independence of all correlations, and (6) Pro- 
fessor Thorndike's newer view of "levels," or the almost universal beUef 
in "types." If the former were true, their criterion would, they consider, 
show an average value of about zero : if the latter, a low minus value. 

Their argument runs as follows. 

If none but quite Specific Factors are present, the correlations will 
all be zero, and the pairs of columns will show no correlation with one 
another. If however correlations exist, but are due to Group Factors 
alone, then tests which share a Group Factor will correlate highly, but 
others will not correlate at all. Let there be three such Group Factors; 
then we shall obtain not a hierarchy but an arrangement hke this: 





^1 


Sz 


s» 


■4, 


A, 


A, 


A 


D, 


Dz 


Si 




h 


h 


I 


I 


I 








si 


i' 




h 


I 


I 


I 








si 


h 


h 


. 


I 


I 


I 








4 










h 


h 








A, 








h 




h 








A, 








h 


h 










A 








I 


I. 


l 




h 


h 


■Oa 








I 


I 


I 


h 




h 


A 


7 






I 


I 


I 


h 


h 


■ 



h=high. correlation. l=low correlation. See Brit. Journ. Psychol. 1912, v. p. 57. 
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in which the high correlations are concentrated along the diagonal. In 
this arrangement some columns will correlate positively, namely those 
in which the high correlations come opposite one another; but these 
will be in the minority and most pairs of columns will correlate negatively. 
Professor Spearman and Dr Hart conclude therefore that in the absence 
of a General Factor the average correlation between columns will be 
either zero or negative, and that only a General Factor will give a very 
high positive correlation between pairs of columns. 

In this consideration of Group Factors however it has been tacitly 
assumed that there is no overlapping of such factors. If this were so 
then indeed a hierarchy would be impossible. But it is at any rate a 
conceivable hypothesis that such overlapping should occur, that for 
example there might exist a factor common to three tests a, b, c and 
another common to c, d, e, so that c contains both factors : and on this 
hypothesis an excellent hierarchy can be obtained without any General 
Factor, and the average column correlation can even approach unity, 
as we shall show presently. 

(5) COMPLICATIONS IN THE ORIGINAL THEORY 

Many experimental researches were inspired by this paper of Dr Hart 
and Professor Spearman, of which, as a good example, may be cited 
one in 1913 by Mr Stanley Wyatt*. It is not too much to say that 
in practically all of these the apphcation of the Hart and Spearman 
criterion gave values closely approximating to unity and therefore sup- 
porting the Theory of General Abihty. But compUcations began to 
arise, of which the first of importance will be found in Dr Edward Webb's 
monograph on "Character and Intelhgence," in 1915t. Dr Webb con- 
sidered that he had found (in addition to Professor Spearman's General 
Abihty) a second general factor, which he calls "persistencs of motives." 
Other writers began to find that their data required for their explanation 
large Group Factors, of wider range than those contemplated in the 
original form of Professor Spearman's theory|. Quite recently Mr J. C. 
Maxwell Garnett, discussing the data of a number of workers with the 
aid of mathematical devices which he has introduced for the purpose, 

* Stanley Wyatt, "The Quantitative Investigation of Higher Mental Processes," 
Brit. Jcmrn. Psychol. 1913, vi. pp. 109—133. 

t E. Webb, "Character and Intelligence," Brit. Journ. Psychol., Monog. Supplement, 
1915, No. 3, pp. ix and 99. 

% See especially N. Carey, "Factors in the Mental Processes of School Children," 
Brit. Journ. Psychol. 1916, vin. pp. 170—182. 
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concludes that in addition to the single general factor of Professor 
Spearman, there are two large Group Factors which are practically 
general* (one of them being indeed almost identical with Dr Webb's 
second general factor), which he calls respectively "Cleverness" and 
"Purpose," both distinct from General AbiHty. 

It is clear therefore that in any case the simple original form of 
Professor Spearman's theory is becoming complicated by additions 
which tend to modify it very considerably. Meanwhile, however, one 
of us had come definitely to the conclusion that the mathematical 
foundations upon which it was based were in fact incorrect. Before 
developing the Hne of argument which led to this, it will be well to re- 
state Professor Spearman's ease in its simplest terms in a few words. 

It is entirely based upon the observation and measurement of hierarchical 
order among correlation coefficients. It states that after allowance has been 
made for sampling errors this hierarchical order is found practically in 
perfection. And, it finally states that such a high degree of perfection can 
only be produced by a General Factor, and the absence of Group Factors, 
which would mar the perfection. 

* J. C. Maxwell Gamett, "General Ability, Cleverness, and Purpose," Brit. Journ. 
Psychol. 1919, IX. pp. 345—366. 



CHAPTEE X* 
A SAMPLING THEOEY OF ABILITY 

The case against the validity of Professor Spearman's argument — Hierarchical 
order produced by random overlap of group factors, without any general factor — 
Application of the "criterion" to these cases, apparently proving the presence of 
a general factor — The erroneous nature of the "criterion" — ^Hierarchical order the 
natural order among correlation coefficients — ^A sampling theory of ability — Transfer 
of training — Conclusions. 

(1) THE CASE AGAINST THE VALIDITY OP PROFESSOR SPEARMAN'S 

ARGUMENT 

As we have already seen in previous chapters, it is possible, by means 
of dice throws or in other ways, to make artificial experiments on 
correlation, with the immense advantage that the machinery producing 
the correlation is known, and that therefore conclusions based upon the 
correlation coefficients can be confronted with the facts. Working on 
these hnes, one of us| made, in 1914, a set of imitation "mental tests" 
(really dice throws of a compUcated kind), which were known to contain 
no General Factor. The correlations were produced by a number of 
Group Factors which were of wide range, and,unhke Professor Spearman's 
Specific or Narrow Group Factors, they were not mutually exclusive. 

These imitation mental tests, containing no General Factor, gave 
however a set of correlation coefficients in excellent hierarchical order, 
and the criterion was when calculated found to be unity, so that had 
these correlation coefficients been pubHshed as the result of experimental 
work, they would have been claimed by Professor Spearman as proving 
the presence of a General Factor. In a short reply Professor Spearman 
laid stress on the fact that this arrangement of Group Factors which 
thus produced practically perfect hierarchical order was not a random 
arrangement, that it was exceedingly improbable that this one special 
arrangement should have occurred in each of the psychological researches 
of many experimenters, so improbable indeed as to be ruled entirely 
out of court J, and that a random arrangement of Group Factors, though 

* Much of this chapter, and part of the preceding, consists of extracts from " General 
versus Group Factors in Mental Activities," by G. H. Thomson, Psychol. Review, 1920, 
xxvn. p. 173. 

t Godfrey H. Thomson, "A Hierarchy without a General Factor," Brit. Journ. 
Psychol. 1916, vra. pp. 271—281. 

i C. Spearman, "Some Comments on Mr Thomson's Paper," Brit. Journ. Psychol. 
1916, vm. p. 282. 
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it might give some hierarchical order, would not give it in the perfection 
actually found. The obvious way to find out if this is so or not is to try it, 
with artificial " mental tests " formed of dice throws. This was done in 
November and December of 1918, after an unavoidable delay of some 
years. Sets of artificial variables (analogous to the scores in mental 
tests) were made, in each of which the arrangement of Group Factors 
was decided by the chance draws of cards from a pack*- It was found 
that hierarchical order resulted, which when measured by the "criterion " 
appeared to be perfect. 



(2) HIERARCHICAL ORDER PRODUCED BY RANDOM OVERLAP OP 
GROUP FACTORS, WITHOUT ANY GENERAL FACTORf 

Write down the letters x^, x^, x^, ... as the names of the variates to 
be formed, and prepare columns to receive the numbers of group factors 
and specific factors in each variate. Determine each number by the 
draw of a card from an ordinary playing pack, returning the card and 
shuffling between each draw: the knave, queen, and king may be counted 
11, 12, and 13 respectively. The result of one such set of drawings is 
shown in this table: 



Variate 


Group factors 


Specific factors 


Total 


*i 
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10 


X2 
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0=1 
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X, 
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12 


=BlO 


11 


5 


16 



Proceed next to identify the group factors of each variate. Do this 
by using a single suit of the pack. After shuffling it well, lay out the 
top five cards to represent the five group factors in Xj^, and note them. 

* Godfrey H. Thomson, " On the Cause of Hierarchical Order among the Correlation 
Coefficients of a Number of Variates taken in Pairs," Proceedings of the Royal Society of 
London, 1919, xov. A, pp. 400—408. See also, by the same author, "The Hierarchy of 
Abilities," and "The Proof, or Disproof of the Existence of General Ability," in Brit. 
Journ. Psychol. 1919, ix. pp. 321—344. 

t This section and also section 5 consists largely of extracts from the Proc. Boy. Soc. 
1919, xov. A, pp. 400—408. 
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After replacing them and reshuffling, do the same for x^, and so on, as 
in this table: 
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The next step is to note the number of factors common to each pair of 
variates, as in this table : 
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From these, and from the total number of factors both specific and group 
in each variate, can be found the correlation which would occur between 
the variates were we to throw dice, one to each factor, and repeat the 
throwings a large number of times. The formula is 

Number of common factors 

T = : > 

Geometrical mean of totals 

This formula is appUcable not only to variates formed by the addition 
of dice, but to variates which are any function of the factors or elements, 
provided that the form of the function is the same in each variate, and 
that the standard deviation is the same for each element or factor* - 
We thus obtain the following table of theoretical correlation coeffi- 
cients : 



* G. H. Thomson, loc. cit. p. 275. It can readily be deduced from Bravaia, Mimoires 
de I'Institut de France, 1846, ix. eqn. 28. 



CH. X] 



A SAMPLING THEOEY OF ABILITY 



177 





Xl 


2^2 


x» 


Xi 


Xi 


Xe 


X, 


ars 


X, 


'Xio 


Totals 


Xi 





0-22 


0-32 


0-00 


0-44 


0'17 


0-31 


0-18 


0-46 


0-32 


2-42 


x^ 


0-22 


— 


0-39 


0-18 


0-29 


0-47 


0-35 


0-00 


0-20 


0-35 


2-45 


^3 
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0-39 
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0-48 
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0-39 
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0-12 


0-41 


0-53 


0-54 
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If we wished to obtain experimental values of these, dice would have 
to be thrown, one to each factor. The die corresponding to the group 
factor called "Ace" would have its score counted into every variate 
containing the group factor in question. The dice representing the 
specific factors would, of course, only be counted into the one variate 
in which they occur. 

The last column of the preceding table gives the total correlation of 
each variate with all the others, found' by adding the rows of the square 
table. Rearrange now the sequence of the variates in the order of 
magnitude of these totals *, and we obtain the following table : 
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Here the tendency to hierarchical order is quite noticeable. This par- 
ticular example is purposely chosen from among a number calculated, 
as being that which shows the least hierarchical tendency. Even here 
however there is clearly a general lowering of the coefficients as we 
pass either along a row or down a column. The columnar correlation 
is high for the first few columns. For the columns headed Xi^ and ajj it is 
0-97, and as far down as the columns headed x^ and Xg it is still 0'65. If 
these theoretical numbers were blurred by experimental error, they might 
well be claimed as having come from a perfect hierarchy by the criteria 
in vogue. Other hierarchies chosen at random from those formed in 
the above manner, show still more perfect hierarchical order, and some- 
* A convenient p.an, but of no theoretical significance. 
B. & T. 12 



178 



CORRELATION 



[PT. II 



times the hierarchical order is almost quite perfect, as in the example 
given in the British Journal of Psychology, 1919, ix. p. 343. 



(3) APPLICATION OF THE "CRITERION" TO THESE CASES, 
APPARENTLY PROVING THE PRESENCE OP A GENERAL FACTOR 

The values of the correlation coefficients given in the above table 
are of course the real values. To obtain experimental values of these, 
dice were thrown, one die to each Group or Specific Factor, and the 
whole repeated 20 times, analogous to experiments on 20 subjects*. 

From the dice scores the observed correlations between the variates 
can be calculated, just as the correlations between mental tests are 
calculated. Using the product-moment formula we obtain the set of 
values in this table, arranged in hierarchical order, only shghtly different 
from the true hierarchical order, except that variate x^ has changed 
its position rather violently. 
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The pairs of columns which pass the Hart and Spearman correctional 
standard give the following values: 



Columns passing 
standard 


Observed columnar 
correlation R 


True columnar 
correlation 


The Hart and Spearman 

corrected columnar 

correlation B' 


2 and 7 
6 „ 7 
2 „ 3 

2 „ 6 

3 „ 6 


073 
©■63 
0^70 
081 
0^66 


0^75 
0-89 
0^60 
0-88 
0^83 


0^76 
115 
101 
108 
1^04 


Means 


0-71 


0^79 


100 



True mean columnar correlation of the whole table and not merely / (..cq 
of the pairs of columns selected by the correctional standard ( 

* G. H. Thomson, Biometrika, 1919, xu. pp. 355 — 366, where the full details of the 
dice throws are given. Sections 3 and 4 of the present chapter consist largely of extracts 
from Biometrika, where the diagrams 28 and 29 appeared. 
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Dr Hart and Professor Spearman would therefore claim the hierarchy 
as being a sample of a perfect one. The true mean columnar correlation 
for the whole table is 0-59, the Hart and Spearman correctional standard 
selects pairs of columns whose true mean columnar correlation is 0-79, 
and the mean value of these when corrected according to their formula 
rises to unity. This example goes far towards shaking confidence in 
their criterion. 



(4) THE ERRONEOUS NATURE OP THE HART AND 
SPEARMAN CRITERION 

The inaccurate and exaggerated estimates of hierarchical order which 
are given by this "criterion" arise chiefly from two causes, (1) the 
erroneous assumption that p and e are uncorrelated (see p. 170), and 
(2) the action of the "correctional standard." We shall consider these 
in turn. 

Consider the formula for the standard deviation of a correlation 
coefficient, viz. 

- 1-^^ 

where "N is the number in the sample. It follows from this that the 
larger correlation coefficients will probably have the smaller samphng 
errors e, disregarding the sign of e for the moment. 

But these signs of the quantities e are not hkely to be indiscriminately 
positive and negative. On the contrary, they will have a tendency to 
be either all positive or all negative, if, as is the case in most of the 
columns of coefficients considered by Professor Spearman, the corre- 
lations in the square table are mainly positive. The errors in the corre- 
lation of a variate aj^ with a variate a are themselves correlated with 
the errors in the correlation of the variate a with another variate x^, 
according to the formula* 

rx,x, - 2(1- r,,/) (1 - r,,„2) 

That is, the correlation of the samphng errors of r^^^ "^^^ ^^^ samphng 
errors of r^^ depends chiefly upon r^^^. To illustrate, let us take three 

* Karl Pearson and L. N. G. Filon, "On the Probable Errors of Frequency Constanta," 
Phil. Trans, of the Boyal Soc. 1898, oxoi. A, eqn. 37. 

12—2 
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correlations from an experiment in psychology, carried out by Mr Wyatt*. 
If we let 

ajj be the mental test "Rearranged Letters," 



'Missing Digits," 
'Analogies," 



the values there found were 



r^,a = 0-63, 
»•.,«= 0-61. 



Then by the above formula the correlation of the errors of these two 
coefficients depends chiefly upon r^^^^, whose measured value is 0'63. 
Using the full formula, and employing the measured values in default 
of the true ones, the correlation between r^^a ^^^ fx^a turns out to be -4:7. 
It is therefore (to an extent indicated by this value) probable that they 
are either both too large or both too small. The same argument holds, 
in varying degrees, for the other correlations all over Mr Wyatt's table, 
which are all positive. They all have a tendency to be either all too 
large or all too small: in other words, the e's tend to be all of the same 
sign. The relationship between the correlation coefficients of a column, 
and their errors, can therefore be summed up in the following table, 
in which the symbol | e | denotes the magnitude of e regardless of sign. 



r' 


|e| 


P' 


e or e 


p'e or p'e 


large 
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\/ 


/ \ 
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+ 


V 


/ \ 
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small 


large 
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+ 








8(p'^) = 


- or + 



The first column shows the true correlations r' arranged in order of 
magnitude. The second column expresses the fact that the samphng 
errors on any occasion will probably be arranged in the reverse order of 
magnitude, disregarding their signs. The third column shows the corre- 
lation coefficients measured from their mean. The upper p"s are then 
positive, and the lower negative, and also, what is not shown in the 

* Stanley Wyatt, "The Quantitative Investigation of Higher Mental Processes," 
Brit. Jmrn. Psychol. 1913, VI. p. 131. 
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table, the absolute values increase upwards and downwards from the 
point where the signs change. The fourth (double) column shows the 
probable arrangement of the signs of the quantities e. If the e's are all 
tending to be positive, then the left-hand 
member of the double column gives the 
arrangement, while if the e's all tend to be 
negative, the other member of the double 
column does so. As shown in the last (double) 
column, therefore, the quantities p'e tend 
either to be nearly all negative or nearly all 
positive. For a very small sample the signs 
of p'e will no doubt be quite irregularly 
arranged. But with such a small sample, 
even if p' and e were really uncorrected, it 
would be most unUkely for S (p'e) to be 
negUgible. As the sample increases the signs 
tend to settle down to the above arrange- 
ment, and S (p'e) does not tend to disappear 
compared with S (ee), but only to take on 
one or other of alternative values. It will 
only be zero when all the errors are zero, 
i.e. when no corrections are needed to R'. 
The distribution of S (p'e) about zero in a 
number of samples of the same size will not, 
that is, show a maximum at zero, but a 
minimum, as is shown quahtatively in Fig. 28. 

If, in fact, the actual value of S (p'e) is 
calculated in cases where the true correla- 
tions are known, it is frequently found to be 
greater than the quantities S (ee) which are 
left in the expression. 

The other approximations made in obtain- 
ing the criterion do not appear to be so 
erroneous as this one, though their cumulative 
effect may explain some anomaUes. Leaving 
them on one side let us consider the "cor- 
rectional standard " required by Dr Hart and 

Professor Spearman before they admit any pair of columns. It is this 
correctional standard, combined with the pecuhar distribution of R', 
which chiefly is responsible for the exaggeration of perfection produced 




'Caasnbajj-^ 



182 COEEELATION [pt. ii 

by this criterion, and for the regularity with which an average vahie of 
unity is arrived at. 

Let us examine first the actual distribution of the Hart and Spearman 
R' in a psychological hierarchy, viz. that of Wyatt already referred to, 
and calculate R' not only for those columns which pass the correctional 
standard, but also for other pairs of columns. What we find is that its 
value rises as we descend the hierarchy, rushing asymptotically to 
infinity, remaining for a time imaginary, and then returning. Specimen 
values from Mr Wyatt's hierarchy are given. 

Pairs of columns Values of the Hart and Spearman Criterion 
Analogies and Wordbuilding 0-93 \ 



Completion and Wordbuilding 
Completion and Part-wholes 
Wordbuilding and Part-wholes 
Part-wholes and Memory (delayed) 
Rearranged letters and Missing digits 
Wordbuilding and E R Test 
Sentence construction and Fables ... 
Rearranged letters and E R Test ... 
Nonsense syllables and dissected pictures 
Crossline test and Letter Squares ... 



Passed bv the correctional standard 



0-97 
105 
0-99 
0-92, 
117 
1-26 
1-33 

Practically infinity 
Imaginary 

0'35, a meaningless value, both factors in the 
denominator being now negative 

Expressed in diagrammatic form this and similar calculations lead 
to the conclusion that in actual practice the criterion is distributed as 
in Fig. 29, where the curve is to be understood as a "best fitting" curve 
among the values of R' scattered, with a very considerable dispersion, 
on both sides of it. The hne, in fact, ought to be a broad smudge. 

Now clearly, with a distribution of this sort, it is very important 
that the boundary between the values that are to be rejected and those 
that are to be accepted should be chosen with the greatest care, and not 
arbitrarily but scientifically. Either sound theoretical reasons should 
be given for the choice of the correctional standard, or the choice should 
be based empirically on experiments in material where the truth is 
known a priori, as in the above dice experiments. For obviously, by 
moving this boundary, we can make the final average take on almost 
any value. Another point is that the criterion rushes to infinity at such 
speed that its probable error must be enormous. Dr Hart and Professor 
Spearman, however, give no reasons for their choice of this particular 
standard, upon which depends so much the values they obtain. The 
standard which they thus arbitrarily adopt begins admitting the criteria 
at just such a distance above unity as to balance the cases which give a 
criterion below unity, and entirely explains the remarkable unanimity with 
which this average value unity is obtained by them in their calculations. 
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In other words, the remarkable regularity with which this criterion 
gives the value unity is not a property of the investigated correlation 
coefficients at all, but is a property possessed by the criterion itself, due 
to errors and the action of the "correctional standard." 
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Descending the hierarchy— » 
Fig. 29 

In the writers' opinion the work outhned in this chapter finally 
proves the invaUdity of Professor Spearman's mathematical argument 
in favour of the Theory of Two Factors. If this be so that theory returns 
to the status of a possible, but unproven, theory. 

(5) HIERARCHICAL ORDER THE NATURAL ORDER AMONG 
CORRELATION COEFFICIENTS 
The fact is that hierarchical order, which Professor Spearman was 
the first to notice among correlation coefficients, is the natural relation- 
ship among these coefficients, on any theory whatever of the cause of 
the correlations, excepting only theories specially designed to prevent 
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its occurrence. It is the absence of hierarchical order which would be 
a remarkable phenomenon requiring special explanation; its presence 
requires none beyond what is termed chance. 

An analogy from the simple repeated measurements of a hnear 
magnitude may help to illustrate this. Indeed it is rather more than an 
analogy, being in fact the same phenomenon in its simplest terms and 
dimensions. It is well known that many measurements of the same 
quantity, made with all scientific precautions, under apparently the 
same conditions, and with an avoidance of all known sources of error, 
nevertheless do not give a number of identical values. The values are 
all different, but are not without law and order in their arrangement. 
They are grouped about a centre from which the density decreases in 
both directions, and it is found that this grouping is for most practical 
purposes closely represented by the Normal or Gaussian Curve of Error 
(or one of the more general Pearsonian Curves). 

Experimenters are not surprised to find their data obeying the 
ProbabiUty Law, nor do they require a special theory to explain it. 
On the contrary, it is the departures from this Law which if wide would 
require special investigation, and if confirmed would require a special 
theory. In the same way hierarchical order among correlation coeffi- 
cients should not cause surprise, though any marked variation from this 
order would demand investigation. 

Correlation coefficients are themselves correlated, and n correlation 
coefficients form an w-fold or w-dimensional correlation-surface. The 
particular and convenient form of tabulation of correlation coefficients 
adopted by Professor Spearman and followed by most other psycho- 
logical workers brings to hght, in the form of "hierarchical order," one 
of the properties of this correlation-surface of the correlations. 

In an article entitled "On the Probable Errors of Frequency Con- 
stants and on the Influence of Random Selection on Variation and 
Correlation," in the Phil. Trans. 1898, cxci. A, pp. 229 — 311, Professor 
Pearson and Mr Filon give the following formulae: 

^n.r„ r^ 2 (1 - ri,2) (1 - ris^) 

p _ 1 + ihs - "ruris) {r^ - ^21^14) + (^14 - ^12^24) (^as - 'r^'r^) \ 

'■"'■" 2 (1 - T^i) (1 - T^^) 

so that, as they say, "errors in the correlations of a first organ with a 
second and a third have a correlation themselves of the first order," 
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and "errors in the correlation of two organs and in the correlation of 
a second two have only correlation of the second order." 

Suppose now that the correlations among a number of variates taken 
in pairs are really all the same, and positive, and in a sample let the 
observed value of rgg be the highest observed value : 
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Hi = highest correlation, ifj = second highest, and so on. fe= tendency to be high. 

Then, because of the above theorems of Pearson and Filon, the rows 
and columns x^ and Xg will probably contain more total correlation than 
do the others, and the second highest correlation will probably be in 
one of these. Let it be rgg. Then, after the rows x^ and Xg the row Xg 
.will probably contain many high correlations and rgg will probably be 
the third highest coefScient, because it is a node where two ridges of 
high correlation cross. If it is, then the hierarchy so far is excellent, as 
can be seen on rearranging the square table so as to bring x^, Xg, and Xg 
to the head*. 

Where now is the fourth highest r likely to be found? Somewhere 
among those marked h for high, and sUghtly more probably among the 
r's of the row x^. If to avoid further rearranging we take it to be r^^, 
then j-gi is most probably the fifth, and r^^ the sixth, because they are 
nodes. And so on. In fact it is clear that when the r's are really equal 

* See next page. 



186 COERELATION [ft. ii 

to one another, then sampling the population will give a set of observed 
r's which are arranged not in haphazard, but in hierarchical order. 
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In the experiment described above with cards and dice, where 
hierarchical tendency was found to be produced by group and specific 
factors without any general factor, there was, however, no question of 
sampUng the population. The hierarchical order already appears in the 
theoretical coefficients. There is here, however, another kind of sampling 
present, viz. sampling of the elements which make up the variates. 

Let us suppose, instead of deciding the numbers of groups and specific 
elements as we did by drawing cards, we had dipped into an infinite bag 
containing black balls and white balls, the former representing group 
factors and the latter specific factors. Then the most probable event 
would have been that the proportion of group factors and specific 
factors in each variate would have been the same as the proportion in 
which the balls occurred in the bag. If, in addition, we assume the samples 
drawn to be all the same size, then the most probable result of the whole 
experiment would have been obtaining all the correlations equal. 

That they do not come out equal may be regarded as due to the 
sampling. The samples vary in size, the proportion of group factors 
varies from sample to sample, and the distribution of the individual 
group factors among the several variates departs from the most probable 
distribution. From this point of view the departure of the correlation 
coefficients from equality is due to errors of sampling, and the applica- 
tion of the theorem of Pearson and Filon would lead us to expect what 
as we have seen does actually occur, namely, hierarchical order. 
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The experiment with the bag of balls would as a matter of fact not 
produce as great a departure from equahty of correlation coefficients 
as is found in practice in experimental psychology, or as was found in 
our form of the experiment with card drawing. This in the latter case 
is because the cards give a greater variation of the proportion of group 
to specific factors than would be found with the bag of balls. And this 
is also the case in mental tests, which are not chance samples of the 
mental elements, but are carefully chosen so as to measure different 
kinds of activity. 

This apphcation of Pearson and Filon's theorem (which contemplates 
only sampling errors in the ordinary sense) to the changes in correlation 
produced by samphng the underlying elements, is no doubt somewhat 
novel, and may appear to be a difficulty. 

Further consideration leads to the following resolution of the 
difficulty. 

Suppose that n variates (in our work the scores in mental tests) 
are so connected by factors that the correlations are all equal and 
positive. Then let a small sample of the population be taken. The 
observed correlations will show departures from equahty, and will be 
found to be in hierarchical order. This hierarchical order is due to 
samphng the population. 

Now consider why the correlations do not come out at their true 
values. They give of course the true values /or the sample. The reason 
of their departing from the true values of the whole population is that 
(a) some of the factors which really are links between the variates (the 
mental activities) happen to have remained steadier than usual during 
the sample. In the hmit a factor might happen to retain exactly the 
same value through the various individuals of the sample. That is, 
some of the hnking factors do not in reahty come into action, or not in 
their full force; (6) on the other hand, some factors which are really 
different and unconnected may happen by chance to rise and fall 
together, through the sample, and more or less to act as one. That is, 
fictitious linking factors are created, which would disappear with a 
larger sample. 

Clearly therefore a hierarchy of correlation coefficients, caused by 
samphng the population, is due to chance having caused a change in 
the apparent factors acting. It follows that if we make a real change 
in the factors acting, we shall get a hierarchy, and this is what we do 
when we choose the mental tests to be employed in any research. Each 
mental test is a test of a sample of abihties. 
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The laws governing the correlation of correlation coefficients which 
vary because of sampling the population can, in fact, be applied without 
hesitation to the relationships between " true " correlations in the whole 
of any population simply because any such population is itself a sample. 
Enghsh grammar school boys of 12 are themselves a sample of a larger 
boyhood ; the whole human race indeed is a sample of " what might have 
been," selected by the struggle for survival. 

The whole question clearly has philosophical bearings on the degree 
of reahty of causal connections ; for on this view those chance Unks in 
a small sample which were a few paragraphs ago termed "fictitious hnks, 
which would disappear with a larger sample," do not differ except in 
degree from the "real" causal hnks which we only term real because 
they persist throughout the largest sample with which we are acquainted. 

In another direction there are connections with the difierence, which 
is one of degree only, between what is called "partial" correlation and 
"entire" correlation*. 

The conclusion to be drawn is that hierarchical order is the natural 
order to expect among correlation coefficients, on a theory of chance 
samphng alone, and that therefore, by the principle of Occam's razor, its 
presence cannot be made the criterion of the existence of any special form 
of causal connection, such as is assumed in the Theory of Two Factors. 

(6) THOMSON'S SAMPLING THEORY OP ABILITYf 
In place therefore of the two factors of that theory, one General 
and the other Specific, Thomson prefers to think of a number of factors 
at play in the carrying out of any activity such as a mental test, these 
factors being a sample of all those which the individual has at his com- 
mand. 

The first reason for preferring this theory is that of Occam's razor. 
It makes fewer assumptions than does the more special form of theory. 
It does not deny General Ability, for if the samples are large there will 
of course be factors common to all activities. On the other hand it does 
not assert General Abihty, for the samples may not be so large as this, 
and no single factor may occur in every activity. If moreover a number 
of factors do run through the whole gamut of activities, forming a 
General Factor, this group need not be the same in every individual. 

* See Karl Pearson, "On the Influence of Natural Selection on the Variability and 
Correlation of Organs," Phil. Trans. Soy. 3oc. London, 1902, oo. A, pp. 1 — 66; Godfrey 
H. Thomson, "The Proof or Disproof of the Existence of General Ability,' Brit. Journ. 
Psychol. 1919, ix. pp. 321—336. 

t Psychol. Review, 1920, xxvii. p. 183. 
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In other words General Ability, if possessed by any individual, need 
not be psychologically of the same nature as any General AbiUty 
possessed by another individual. .Everyone has probably known men 
who were good all round, but Jones may be a good all round man for 
different reasons from those which make Smith good all round. 

The SampUng Theory, then, neither denies nor asserts General 
Ability, though it says it is unproven. Nor does it deny Specific Factors. 
On the other hand it does deny the absence of Group Factors. It is this 
absence of Group Factors which is in truth the crux of Professor 
Spearman's theory, which is not so much a theory of general ability, 
or a theory of two factors, as a Theory of the Absence of Group Factors. 
And inasmuch as its own disciples have begun to require Group Factors 
to explain their data, its distinguishing mark would appear in any case 
to be disappearing. 

Such Group Factors as are admitted by Professor Spearman are of 
very narrow range, and are mutually exclusive, that is they do not 
overlap. Both these points follow from the sentence used in the 1912 
article with Dr Hart, where it is said that, in the case of performances 
too alike, "when this hkeness is diminished, or when the resembhng 
performances are pooled together, a point is soon reached where the 
correlations are still of considerable magnitude, but now indicate no 
common factor except the General one." 

Since this point is soon reached, the Group Factors must be narrow 
in range. Since poohng a few performances will obhterate any Group 
Factors, they must be exclusive of one another. For if A, B, C and D 
are four tests, in which A and B have a Group Factor common to them, 
and C and D another, then of course by poohng A with B and also C 
with D we can obtain two pools AB and CD which have no fink. But 
a A, B and C have one Group Factor, and C and D have another, then 
these Group Factors cannot be separated into Specific Factors. In fact, 
a Specific Factojjs^a separated Group Factor, and Professor Spearman's 
theory asserts that Group Factors, if any, are separable and mutually 
exclusive. This is a great stumbUng-block in the way of the acceptance 
of the Theory of Two Factors, unless perhaps "Specific Factor" is 
interpreted in the way suggested later. 

It is a fact which will be admitted by most that the same activity 
is not performed in the same way by different individuals, even though 
they are equally expert. Not only are Specific Factors therefore required 
by this theory for every separate activity, excluding only any which 
are very closely similar, but also Specific Factors of different psycho- 
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logical natures are required for each individual. Further, the same 
individual does not always perform the same activity in the same way. 
A man using an ergograph will, as he tires, begin to employ muscles 
other than those naturally used at the outset. When we are returning 
from a cycle ride muscles are used in a different manner from the style 
adopted at the start, indeed sometimes dehberate changes are made to 
give rehef . And in the same way a mental task is performed by different 
methods at different times. Does this then mean a different Specific 
Factor for each way of doing a task? All these difficulties appear to 
argue against the Theory of Two Factors, and seem to be considerably 
cleared up by the SampHng Theory. 

Finally, the Samphng Theory appears to be in accordance with a 
Une of thought which has already proved fruitful in other sciences. 
Any individual is, on the Mendehan theory, a sample of unit quahties 
derived from his parents, and of these a further sample is apparent 
and exphcit in the individual, the balance being dormant but capable 
of contributing to the sample which is to form his child. It seems a 
natural step further to look upon any activity carried out by this indi- 
vidual as involving a further sample of these quahties. 

(7) THE DIFFICULTY OP "TRANSFER OP TRAINING" 

Although Professor Spearman's Theory of Two Factors has been 
chiefly based by him on the hne of argument which, it is suggested, 
has now been proved invahd, viz. the "hierarchy" argument, yet there 
is another and powerful form of reasoning which can be brought to 
its support, based upon the fact that, according to some experimenters, 
improvement in any activity due to training does not transfer in any 
appreciable amount to any other activity, except to those very similar 
indeed to the trained activity. And even those workers who do not 
agree that this is an experimental fact are usually content to take a 
defensive attitude and say that transfer is not disproved. Few if any 
will say that it is proved. 

This certainly seems to point to the absence of Group Factors, and 
to support Professor Spearman's theory, which only needs to add to 
itself the assumption that the Specific Factors are, while the General 
Factor is not, capable of being improved by training, to fit the case 
admirably. Of course, if transfer really occurs, the argument proves 
the opposite. And although psychological experiment points on the 
whole to the absence or the narrowness of transfer, yet popular opinion 
among business men, schoolmasters, and others, is in favour of transfer 
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to a considerable extent. Assuming no transfer, however, how can the 
Sampling Theory, with its numerous Group Factors, explain this? 

It is necessary to assume that the Group Factors are all unim- 
provable or only slightly improvable by training, though they may 
change with the growth and development of the individual. The 
improvement which certainly takes place when we practise any activity 
is due, it may then be assumed, not to improvement in the elemental 
abihties which form the sample, hut to a weeding out, and selection of 
these. The sample alters, mainly no doubt is diminished, though addi- 
tions are klso conceivable. It becomes a more economical sample, and 
waste of effort in using elements which are unnecessary is avoided. 
Improvement in any mental activity may on this view be compared 
with improvement in a manual dexterity, in which it is notorious that 
the improvement consists largely in the avoidance of unnecessary 
movements. 

When another activity is then attempted, the elemental factors are 
just the same as they would have been had the practice in the first 
activity not taken place. The new activity will be performed by a new 
group of factors, which sample will as in the first case be in the beginning 
wasteful and will include many unnecessary elements. Transfer of 
improvement gained in the first activity will therefore not take place 
except in so far as the second activity is recognised as a mere variant of 
the original one, in which case the weeding out process which has taken 
place in the first case may be done at the very first attempt, at any rate 
to some extent. 

To use another analogy, the improvement which takes place when 
a football team practises playing together for a series of matches is due 
more to team work than to individual improvement. A new team, even 
though it contain a large proportion of players from the first team, will not 
have this unity of action. There will be little transfer of improvement. 

According to the view here developed, it is the weeding out of the 
sample of elemental abilities which is specific. The team work is specific, 
though the players play for several clubs. This would appear to enable 
a reconciliation to be effected between the almost universal belief in 
" types " of abiUty (to which Professor Spearman refers) and the experi- 
mental facts concerning both correlation and transfer. If there be a 
General Factor at all, it might be the power to shake flown rapidly into 
good team work, in a word, educabihty. But there seems no objection 
to assuming that this, instead of being a General Factor, is a property 
of each elemental factor, varying from factor to factor. 
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To sum up this section : if transfer of training really does not occur 
to any great extent, then it has to be admitted that the Theory of Two 
Factors readily explains this. But the Sampling Theory can also do so, 
in a manner which is perhaps not so easy to set forth, but which never- 
theless appears to be more illuminating and less artificial than the 
alternative theory. 

(8) CONCLUSIONS 

Professor Spearman's Theory of Two Factors, which assumes that 
ability in any performance is due to (a) a General Factor and (b) a Specific 
Factor (Group Factors being absent, or at any rate very narrow in 
range and mutually exclusive), is based chiefly on the observed fact that 
correlation coefficients in psychological tests tend to fall into "hierar- 
chical order." It has been shown, however, that the criterion adopted 
for evaluating the degree of perfection of hierarchical order present is 
untrustworthy and has led to over-estimation. Such hierarchical order 
as is actually present is in fact the natural thing to expect, and it is the 
absence of such which should occasion surprise. The proof of the Theory 
of Two Factors which is based on the presence of hierarchical order 
therefore falls to the ground. The theory remains a possible explanation 
of the facts but ceases to be the unique explanation. As an alternative 
theory Thomson has advanced a Samphng Theory of Abihty, in which 
any performance is considered as being carried out by a sample of 
Group Factors. This theory is preferred because it makes fewer and less 
special assumptions, because it is more elastic and wider, and because 
it is in closer accord with theories in use in biology and in the study of 
heredity *- 

* For articles on the General Ability controversy which have appeared while this 
book was going through the press the reader is referred to the conclusion of the Biblio- 
graphy, p. 212. 
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1. Fechner's Fundamental Table. 

2. Urban's Tables for the Constant Process. 

3. Table of Muller-Urban Weights. 

4. Reciprocals of pq, where p + q = 1. 

5. Rich's Checking Table for the Constant Process. 

1. Fechner's Fundamental Table. 
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0-87 


0-7965 


0-54 


00710 


0-71 


0-3913 


0-88 


0-8308 


0-55 


0-0888 


0-72 


0-4121 


0-89 


0-8673 


0-56 


0-1067 


0-73 


0-4333 


0-90 


0-9062 


0-57 


0-1247 


0-74 


0-4549 


0-91 


0-9480 


0-58 


0-1427 


0-75 


0-4769 


0-92 


0-9935 


0-59 


01609 


0-76 


0-4994 


0-93 


10435 


0-60 


0-1792 


0-77 


0-5224 


0-94 


1-0993 


0-61 


0-1975 


0-78 


0-5460 


0-95 


1-1630 


062 


0-2160 


0-79 


0-5702 


0-96 


1-2380 


0-63 


0-2346 


0-80 


0-5951 


0-97 


1-3300 


0-64 


0-2535 


0-81 


0-6208 


0-98 


1-4520 


0-65 


0-2725 


0-82 


0-6473 


0-99 


1-6450 


0-66 


0-2916 


0-83 


0-6747 


1-00 


00 



If p is < 0-5, look in the Table not for f but for \ — f, and take 
y negative. Thus the y f or ^ = 0-25 is — 0-4769. 

This Table is less generally useful than Sheppard's Tables of the 
Probability Integral, Tables I and II of Pearson's Tables for Statisticians 
and Biometricians (Cambridge University Press). Table I there differs 
from this only by a factor v'2 : but gives many more values and to more 
decimal places. 
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2. Urban' s Tables for the Constant Process. 
From Archivf. d. ges. Psychol 1912, xxiv. 240 — 241*. 



p 


iw 


jyW 


2W 


22 Jf 


2yW 


3 W 


3^W 


3yW 


iW 


42 Jf 


47!^ 


0-50 


10000 


0-0000 


2-0000 


4-0000 


0-0000 


3-0000 


9-0000 


0-0000 


4-0000 


16-0000 


0-0000 


0-51 


0-9998 


0-0177 


1-9996 


3-9991 


0-0354 


2-9993 


8-9980 


0-0531 


3-9991 


15-9965 


0-0708 


0-52 


0-9991 


0-0355 


1-9982 


3-9963 


0-0709 


2-9972 


8-9917 


0-1064 


3-9963 


15-9853 


0-1419 


0-53 


0-9980 


0-0531 


1-9959 


3-9918 


0-1062 


2-9938 


8-9816 


0-1593 


3-9918 


15-9672 


0-2124 


0-54 


0-9964 


0-0707 


1-9928 


3-9855 


0-1415 


2-9891 


8-9674 


0-2122 


3-9855 


15-9421 


0-2830 


0-55 


0-9943 


0-0883 


1-9886 


3-9772 


0-1766 


2-9829 


8-9487 


0-2649 


3-9772 


15-9088 


0-3532 


0-56 


0-9918 


0-1058 


1-9836 


3-9671 


0-2116 


2-9753 


8-9260 


0-3175 


3-9671 


15-8685 


0-4233 


0-57 


0-9888 


0-1233 


1-9776 


3-9551 


0-2466 


2-9663 


8-8990 


0-3699 


3-9551 


15-8205 


0-4932 


0-58 


0-9853 


0-1406 


1-9706 


3-9413 


0-2812 


2-9560 


8-8679 


0-4218 


3-9413 


15-7651 


0-5624 


0-59 


0-9814 


0-1579 


1-9627 


3-9254 


0-3158 


2-9441 


8-8322 


0-4737 


3-9254 


15-7018 


0-6316 


0-60 


0-9768 


0-1750 


1-9537 


3-9074 


0-3501 


2-9306 


8-7916 


0-5252 


3-9074 


15-6296 


0-7002 


0-61 


0-9720 


0-1920 


1-9440 


3-8881 


0-3839 


2-9161 


8-7482 


0-5759 


3-8881 


15-5523 


0-7679 


0-62 


0-9666 


0-2088 


1-9332 


3-8663 


0-4176 


2-8997 


8-6992 


0-6263 


3-8663 


15-4653 


0-8351 


0-63 


0-9607 


0-2254 


1-9214 


3-8429 


0-4508 


2-8822 


8-6465 


0-6762 


3-8429 


15-3715 


0-9015 


0-64 


0-9542 


0-2419 


1-9084 


3-8168 


0-4838 


2-8626 


8-5878 


0-7257 


3-8168 


15-2672 


0-9676 


065 


0-9473 


0-2581 


1-8945 


3-7890 


0-5163 


2-8418 


8-5253 


0-7744 


3-7890 


15-1562 


1-0325 


0-66 


0-9398 


0-2741 


1-8797 


3-7594 


0-5481 


2-8196 


8-4586 


0-8222 


3-7594 


15-0376 


1-0962 


0-67 


0-9317 


0-2899 


1-8634 


3-7268 


0-5797 


2-7951 


8-3853 


0-8696 


3-7268 


14-9072 


1-1594 


0-68 


0-9232 


0-3053 


1-8464 


3-6929 


0-6106 


2-7697 


8-3090 


0-9159 


3-6929 


14-7715 


1-2212 


0-69 


0-9140 


0-3205 


1-8280 


3-6561 


0-6409 


2-7421 


8-2262 


0-9614 


3-6561 


14-6243 


1-2818 


0-70 


0-9043 


0-3353 


1-8085 


3-6170 


0-6706 


2-7128 


8-1383 


1-0059 


3-6170 


14-4682 


1-3412 


0-71 


0-8939 


0-3498 


1-7878 


3-5755 


0-6996 


2-6816 


8-0449 


1-0493 


3-5755 


14-3021 


1-3991 


0-72 


0-8830 


0-3639 


17659 


3-5318 


0-7277 


2-6489 


7-9466 


1-0916 


3-5318 


14-1274 


1-4555 


0-73 


0-8713 


0-3775 


1-7426 


3-4852 


0-7541 


2-6139 


7-8417 


1-1326 


3-4852 


13-9408 


1-5101 


0-74 


0-8590 


0-3908 


1-7180 


3-4360 


0-7815 


2-5770 


7-7310 


1-1723 


3-4360 


13-7440 


1-5630 


0-75 


0-8460 


0-4035 


1-6921 


3-3842 


0-8070 


2-5381 


7-6144 


1-2104 


3-3842 


13-5366 


1-6139 


0-76 


0-8323 


0-4157 


1-6646 


3-3293 


0-8313 


2-4970 


7-4909 


1-2470 


3-3293 


13-3171 


1-6626 


0-77 


0-8179 


0-4273 


1-6357 


3-2714 


0-8545 


2-4536 


7-3607 


1-2818 


3-2714 


13-0858 


1-7090 


0-78 


0-8025 


0-4382 


1-6051 


3-2102 


0-8764 


2-4076 


7-2229 


1-3146 


3-2102 


12-8406 


1-7527 


0-79 


0-7865 


0-4484 


1-5729 


3-1459 


0-8969 


2-3594 


7-0782 


1-3453 


3-1459 


12-5835 


1-7938 


0-80 


0-7695 


0-4579 


1-5390 


30780 


0-9159 


2-3085 


6-9255 


1-3738 


3-0780 


12-3120 


1-8317 


0-81 


0-7515 


0-4665 


1-5031 


3-0061 


0-9331 


2-2546 


6-7638 


1-3996 


3-0061 


12-0245 


1-8662 


0-82 


0-7327 


0-4743 


1-4653 


2-9307 


0-9485 


2-1980 


6-5940 


1-4228 


2-9307 


11-7227 


1-8970 


0-83 


0-7129 


0-4810 


1-4257 


2-8515 


0-9619 


2-1386 


6-4158 


1-4429 


2-8515 


11-4059 


1-9239 


0-84 


0-6921 


0-4866 


1-3842 


2-7683 


0-9732 


2-0762 


6-2287 


1-4598 


2-7683 


11-0733 


1-9464 


0-85 


0-6697 


0-4908 


1-3394 


2-6788 


0-9816 


20091 


6-0273 


1-4725 


2-6788 


10-7152 


1-9633 


0-86 


0-6463 


0-4937 


1-2927 


2-5853 


0-9875 


1-9390 


5-8170 


1-4812 


2-5853 


10-3413 


1-9749 


0-87 


0-6215 


0-4950 


1-2430 


2-4860 


0-9900 


1-8645 


5-5935 


1-4851 


2-4860 


9-9440 


1-9801 


0-88 


0-5953 


0-4946 


1-1907 


2-3813 


0-9892 


1-7860 


5-3580 


1-4838 


2-3813 


9-5253 


1-9784 


0-89 


0-5673 


0-4920 


1-1346 


2-2692 


0-9840 


1-7019 


5-1056 


1-4760 


2-2692 


9-0766 


1-9680 


0-90 


0-5376 


0-4871 


1-0751 


2-1502 


0-9743 


1-6126 


4-8380 


1-4614 


2-1502 


8-6008 


1-9485 


091 


0-5059 


0-4796 


1-0118 


2-0236 


0-9592 


1-5177 


4-5531 


1-4388 


2-0236 


8-0944 


1-9184 


092 


0-4718 


0-4687 


0-9435 


1-8871 


0-9374 


1-4153 


4-2459 


1-4061 


1-8871 


7-5483 


1-8748 


0-93 


0-4351 


0-4540 


0-8702 


1-7403 


0-9080 


1-3052 


3-9157 


1-3620 


1-7403 


6-9613 


1-8160 


0-94 


0-3954 


0-4346 


0-7907 


1-5814 


0-8692 


1-1861 


3-5582 


1-3039 


1-5814 


6-3258 


1-7385 


0-95 


0-3519 


0-4093 


0-7038 


1-4076 


0-8185 


1-0557 


3-1671 


1-2278 


1-4076 


5-6304 


1-6370 


0-98 


0-3036 


0-3759 


0-6073 


1-2146 


0-7518 


0-9109 


2-7328 


1-1277 


1-2146 


4-8582 


1-5036 


0-97 


0-2469 


0-3282 


0-4936 


0-9871 


0-6564 


0-7403 


2-2210 


0-9847 


0-9871 


3-9485 


1-3129 


0-98 


0-1881 


0-2732 


0-3762 


0-7525 


0-5463 


0-5644 


1-6931 


0-8195 


0-7525 


3-0099 


1-0926 


0-99 


0-1127 


0-1854 


0-2254 


0-4508 


0-3708 


0-3381 


1-0142 


0-5561 


0-4508 


1-8030 


0-7415 



* With three corrections, two of which were made by Urban in the Praxis der Kon- 
sitmzmethode, and the third by Rich in Amer. Journ. Psychol. 1918, xxix. 121. The last 
was also rediscovered by the Cambridge Univ. Press proof reader. 
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2. Urban' s Tables for the Constant Process (contd.). 

From Archivf. d. ges. Psychol. 1912, xxiv. 240—241. 



p 


5W 


5'W 


5yW 


6W 


Q^W 


6yW 


TW 


1'W 


TyW 


0-50 


5-0000 


25-0000 


0-0000 


6-0000 


36-0000 


0-0000 


70000 


49-0000 


0-0000 


0-51 


4-9989 


24-9945 


0-0885 


5-9987 


35-9921 


0-1062 


6-9985 


48-9892 


0-1239 


0-52 


4-9954 


24-9770 


0-1773 


5-9945 


35-9669 


0-2128 


6-9936 


48-9549 


0-2483 


0-53 


4-9898 


24-9488 


0-2655 


5-9877 


35-9262 


0-3185 


6-9856 


48-8996 


0-3716 


0-54 


4-9819 


24-9095 


0-3537 


5-9783 


35-8697 


0-4251 


6-9747 


48-8266 


0-4960 


0-55 


4-9715 


24-8575 


0-4415 


5-9658 


35-7948 


0-5298 


6-9601 


48-7207 


0-6181 


0-56 


4-9589 


24-7945 


0-5291 


5-9507 


35-7041 


0-6349 


6-9425 


48-5972 


0-7408 


0-57 


4-9439 


24-7195 


0-6165 


5-9327 


35-5961 


0-7398 


6-9215 


48-4502 


0-8631 


0-58 


4-9266 


24-6330 


0-7030 


5-9119 


35-4715 


0-8436 


6-8972 


48-2807 


0-9842 


0-59 


4-9068 


24-5340 


0-7895 


5-8882 


35-3290 


0-9474 


6-8695 


48-0866 


1-1053 


0-60 


4-8842 


24-4212 


0-8753 


5-8611 


35-1666 


1-0503 


6-8380 


47-8656 


1-2254 


0-61 


4-8601 


24-3005 


0-9599 


5-8321 


34-9927 


1-1518 


6-8041 


47-6290 


1-3438 


0-62 


4-8329 


24-1645 


1-0439 


5-7995 


34-7969 


1-2527 


6-7661 


47-3624 


1-4615 


0-63 


4-8036 


24-0180 


1-1269 


5-7643 


34-5859 


1-3523 


6-7250 


47-0753 


1-5777 


0-64 


4-7710 


23-8550 


1-2094 


5-7252 


34-3512 


1-4513 


6-6794 


46-7558 


1-6932 


0-65 


4-7363 


23-6815 


1-2906 


5-6836 


34-1014 


1-5488 


6-6308 


46-4157 


1-8069 


0-66 


4-6992 


23-4962 


1-3703 


5-6391 


33-8346 


1-6444 


6-5790 


46-0526 


1-9184 


0-67 


4-6585 


23-2925 


1-4493 


5-5902 


33-5412 


1-7391 


6-5219 


45-6533 


2-0290 


0-68 


4-6161 


23-0805 


1-5265 


5-5393 


33-2359 


1-8319 


6-4625 


45-2378 


2-1372 


0-69 


4-5701 


22-8505 


1-6023 


5-4841 


32-9047 


1-9227 


6-3981 


44-7870 


2-2432 


0-70 


4-5213 


22-6065 


1-6765 


5-4256 


32-5534 


2-0118 


6-3298 


44-3087 


2-3471 


0-71 


4-4694 


22-3470 


1-7489 


5-3633 


321797 


2-0987 


6-2572 


43-8001 


2-4484 


0-72 


4-4148 


22-0740 


1-8193 


5-2978 


31-7866 


21832 


6-1807 


43-2650 


2-5471 


0-73 


4-3565 


21-7825 


1-8877 


5-2278 


31-3668 


2-2652 


6-0991 


42-6937 


2-6427 


0-74, 


4-2950 


21-4750 


1-9538 


5-1540 


30-9240 


2-3446 


6-0130 


42-0910 


2-7353 


0-75 


4-2302 


21-1510 


2-0174 


5-0762 


30-4574 


2-4209 


5-9223 


41-4560 ' 


2-8243 


0-76 


4-1616 


20-8080 


2-0783 


4-9939 


29-9635 


2-4940 


5-8262 


40-7837 


2-9096 


0-77 


4-0893 


20-4465 


21363 


4-9072 


29-4430 


2-5635 


5-7250 


40-0751 


2-9908 


0-78 


4-0127 


20-0635 


2-1909 


4-8152 


28-8914 


2-6291 


5-6178 


39-3245 


3-0673 


0-79 


3-9324 


19-6618 


2-2422 


4-7188 


28-3129 


2-6907 


5-5053 


38-5370 


3-1391 


0-80 


3-8475 


19-2375 


2-2896 


4-6170 


27-7020 


2-7476 


5-3865 


37-7055 


3-2055 


0-81 


3-7576 


18-7882 


2-3327 


4-5092 


27-0551 


2-7993 


5-2607 


36-8250 


3-2658 


0-82 


3-6634 


18-3168 


2-3713 


4-3960 


26-3761 


2-8455 


5-1287 


35-9008 


3-3198 


0-83 


3-5644 


17-8218 


2-4049 


4-2772 


25-6631 


2-8858 


4-9901 


34-9306 


3-3668 


0-84 


3-4604 


17-3020 


2-4330 


41525 


24-9149 


2-9196 


4-8447 


33-9119 


3-4062 


0-85 


3-3485 


16-7425 


2-4541 


4-0182 


24-1092 


2-9449 


4-6879 


32-8153 


3-4358 


0-86 


3-2316 


161582 


2-4687 


3-8780 


23-2679 


2-9624 


4-5243 


31-6702 


3-4561 


0-87 


3-1075 


15-5375 


2-4751 


3-7290 


22-3740 


2-9701 


4-3505 


30-4535 


3-4652 


0-88 


2-9766 


14-8832 


2-4730 


3-5720 


21-4319 


2-9676 


4-1673 


29-1712 


3-4622 


0-89 


2-8364 


14-1822 


2-4600 


3-4037 


20-4222 


2-9521 


3-9710 


27-7972 


3-4441 


0-90 


2-6878 


13-4388 


2-4356 


3-2253 


19-3518 


2-9228 


3-7628 


26-3400 


3-4099 


0-91 


2-5295 


12-6475 


2-3980 


3-0354 


18-2124 


2-8776 


3-5413 


24-7891 


3-3572 


0-92 


2-3588 


11-7942 


2-3435 


2-8306 


16-9837 


2-8122 


3-3024 


23-1167 


3-2809 


0-93 


2-1754 


10-8770 


2-2700 


2-6105 


15-6629 


2-7240 


3-0456 


21-3189 


3-1780 


0-94 


1-9768 


9-8840 


2-1731 


2-3722 


14-2330 


2-6077 


2-7675 


19-3726 


3-0423 


0-95 


1-7595 


8-7975 


2-0463 


2-1114 


12-6684 


2-4556 


2-4633 


17-2431 


2-8648 


0-96 


1-5182 


7-5910 


1-8795 


1-8218 


10-9310 


2-2554 


2-1255 


14-8784 


2-6313 


0-97 


]-2339 


6-1695 


1-6411 


1-4807 


8-8841 


1-9693 


1-7275 


12-0922 


2-2975 


0-98 


0-9406 


4-7030 


1-3658 


1-1287 


6-7723 


1-6389 


1-3168 


9-2179 


1-9121 


0-99 


0-5634 


2-7172 


0-9269 


0-6761 


3-9568 


1-1123 


0-7888 


5-4218 


1-2976 
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3. Table of Muller-Urban Weights*. 



p 


W 


P 


W 


P 


W 


0-60 


1-000 


0-67 


0-932 


0-84 


0-694 


0-51 


1000 


0-68 


0-923 


0-85 


0-670 


0-52 


0-999 


0-69 


0-914 


0-86 


0-646 


0-53 


0-998 


0-70 


0-904 


0-87 


0-621 


0-54 


0-996 


0-71 


0-894 


0-88 


0-595 


0-55 


0-995 


0-72 


0-883 


0-89 


0-567 


0-56 


0-992 


0-73 


0-871 


0-90 


0-538 


0-57 


0-989 


0-74 


0-859 


0-91 


0-506 


0-58 


0-985 


0-75 


0-846 


0-92 


0-472 


0-59 


0-981 


0-76 


0-832 


0-93 


0-435 


0-60 


0-977 


0-77 


0-818 


0-94 


0-396 


0-61 


0-972 


0-78 


0-803 


0-95 


0-352 


0-62 


0-967 


0-79 


0-787 


0-96 


0-304 


0-63 


0-960 


0-80 


0-770 


0-97 


0-249 


0-64 


0-954 


0-81 


0-752 


0-98 


0187 


0-65 


0-947 


0-82 


0-733 


0-99 


0112 


0-66 


0-940 


0-83 


0-713 


1-00 


0000 



The weight of a ^ which is less than 0-5 is the same as the weight of 
a p which exceeds 0-5 by the same amount. Thus the weights oip = 0-25 
and oi p = 0-75 are both alike, = 0-846. 

* The table is quoted from F. M. Urban, "The Method of Constant Stimuli and its 
Generalisations," Psychological Beview, 1910, xvn. p. 253. See also "Die psychophyslschen 
Massmethoden als Grundlagen empirischer Messungen," by the same author, Archiv f. d. 
gea. Psychologie, xv. and xti. 

4. Reciprocals of pq, where p + q = 1. 





1 




1 




1 


p 01 q 


pq 


p OT q 


pq 


p 01 q 


pq 


■50 


4-0 


-67 


4^5 


-84 


7-5 


-51 


4-0 


-68 


4^6 


•85 


7-9 


-52 


4-0 


-69 


4^7 


•86 


83 


-53 


4-0 


-70 


4-8 


•87 


8-8 


-54 


4-0 


-71 


4-9 


•88 


9-4 


-55 


4-1 


-72 


5-0 


•89 


10-2 


-56 


4-1 


-73 


5-1 


•90 


11-1 


-57 


4-1 


-74 


6-2 


•91 


12-2 


-58 


4-1 


-75 


5-3 


•92 


13-6 


•59 


4-1 


-76 


5-4 


•93 


15-4 


•60 


4-2 


•77 


5-6 


•94 


17-7 


-61 


4-2 


•78 


5-8 


•95 


21-0 


-62 


4-3 


•79 


6-0 


■96 


26-0 


-63 


43 


•80 


6-2 


•97 


34-4 


-64 


4.4 


•81 


6-5 


•98 


51-0 


•65 


4.4 


•82 


6-8 


•99 


101 


•66 


4-5 


•83 


7-1 


bOO 


00 
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5. Rich's Checking Table for the Constant Method. 

Published from the Laboratory of Cornell University, in 
Amer. Journ. Psychol. 1918, xxix. 120. 

An example will best explain the use of tliis Table. Consider the 

example worked on p. 73. If we form for each hne the totals of the 

five quantities 

W + yW + sW + sW + syW 
we get the results 



s 


P 


Totals 


6 


•00 


•0000 


5 


•10 


13^2371 


4 


•14 


9^8835 


3 


•40 


7^1880 


2 


•65 


2^5836 





•80 


1^2274 


2 


•87 


5^8355 


4 


•96 


^•2559 


6 


100 


00000 



482110 

These totals however will all be found in Rich's Table entered with 
the proper s (Rich's x) and f, and can thus be checked. The grand total 
48'2110 ought to agree with the sum of the last row of the table (4) on 
p. 73, i.e. with 

4-8026 + 0-4311 - 7-6406 + 43-7049 + 6-9130. 
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p 


x = l 


.r = 2 


x = 3 


x=i 


x=5 


x = 6 


x = l 


•01 


- 0327 


-2327 


-7235 


1-4396 


2-2810 


3-4479 


4-8403 


■02 


•0179 


-4973 


1-3529 


2-5847 


4-1927 


6-1770 


8-5375 


■03 


•0843 


-7430 


1-8953 


3-5414 


5-6810 


8-3142 


11-4409 


•04 


■1590 


-9978 


2-4437 


4-4969 


7-1674 


10-4251 


14-3003 


•05 


■2371 


1-2355 


2-9376 


5-3436 


8-4633 


12-2668 


16-7842 


•06 


■3170 


1-4637 


3-4012 


6-1295 


9-6486 


13-9583 


190586 


•07 


■3973 


1-6836 


3-8400 


6-8667 


10-7635 


15-5305 


21-1676 . 


•08 


■4780 


1-8963 


4-2582 


7-6637 


11-8126 


17-0052 


23-1413 


■09 


■5585 


2-1025 


4-6583 


8-2259 


12-8063 


18-3965 


24-9995 


■10 


■6386 


2-3015 


5-0397 


8-8530 


13-7415 


19-7048 


26-7434 


■11 


■7179 


2-4951 


5-4068 


9-4531 


14-6339 


20-9491 


28-3994 


■12 


■7967 


2-6835 


5-7609 


10-0289 


15-4875 


221370 


29-9770 


■13 


■8745 


2-8655 


6-0994 


10-5764 


16-2964 


23-2594 


31-4663 


■14 


■9515 


3-0431 


6-4274 


11-1043 


17^0737 


24-3361 


32-8910 


■15 


1^0275 


3-2155 


6-7428 


11-6096 


17^8158 


25-3614 


34-2463 


■16 


11031 


3-3848 


7-0506 


12-1007 


18^6349 


26-3533 


35-5659 


■17 


1-1767 


3-5472 


7-3434 


12-5654 


19^2132 


27-2864 


36-7858 


■18 


1-2495 


3-7059 


7^6276 


13-0148 


19^8673 


28-1860 


37-9681 


■19 


13215 


3-8611 


7-9038 


13-4494 


20-4981 


29-0500 


39-1049 


■20 


13927 


40127 


8-1718 


13-8699 


21-1070 


29-8830 


40-1981 


■21 


1^4627 


41600 


8-4304 


14-2737 


21-6901 


30-6791 


41-2413 


■22 


15311 


4-3032 


8-6802 


14-6624 


22-2496 


31-4418 


42-2393 


■23 


1-5991 


4^4432 


8-9231 


15-0388 


22-7901 


32-1773 


43-1999 


■24 


1^6655 


4^6792 


9-1575 


15-4004 


23-3079 


32-8800 


44-1169 


■25 


1^7310 


4^7118 


9-3846 


15-7494 


23-8063 


33-5562 


44-9965 


■26 


1^7954 


4-8407 


9-6039 


16-0852 


24-2844 


34-2016 


46^8369 


■27 


1^8589 


4-9675 


9-8168 


16-4097 


24-7451 


34-8232 


46^6439 


■28 


1-9212 


5-0891 


10-0230 


16-7228 


26-1886 


35-4203 


47^4177 


■29 


1-9821 


5-2078 


10-2213 


17-0226 


25-6116 


35-9884 


48^1530 


■30 


20423 


5-3239 


10-4142 


17-3130 


26-0203 


36-5362 


48^8604 


■31 


2-1010 


5-4367 


10-6004 


17-6921 


26-4118 


37-0596 


49^5354 


•32 


2-1590 


5-5466 


10-7807 


17-8611 


26-7880 


37-5612 


50^1810 


■33 


2-2153 


5-6523 


10-9526 


18-1164 


27-1435 


38-0341 


50-7880 


■34 


2-2712 


5-7667 


11-1217 


18-3665 


27-4908 


38-4950 


51-3789 


■35 


2-3257 


5-8564 


11-2819 


18-6019 


27-8164 


38-9254 


61-9288 


■36 


2^3788 


5-9537 


11-4370 


18-8287 


28-1289 


39-3374 


52-4543 


■37 


24313 


6-0488 


11-5878 


19-0482 


28-4300 


39-7332 


52-9579 


■38 


2^4822 


6-1397 


11-7304 


19-2543 


28-7113 


40-1015 


53-4248 


■39 


2^5320 


6-2282 


11-8684 


194525 


28-9807 


40-4530 


53-8693 


•40 


2-5804 


6-3128 


11-9988 


19-6386 


29-2319 


40-7792 


54-2800 


■41 


2-6284 


6-3958 


12-1261 


19-8191 


29-4748 


41-0933 


54-6743 


■42 


2-6747 


6-4754 


12-2468 


19-9887 


29-7013 


41-3845 


65-0384 


■43 


2-7198 


6^5516 


12-3609 


20-1479 


29-9124 


41-6545 


55-3741 


■44 


2-7638 


6-6251 


12-4698 


20-2983 


30-1103 


41-9059 


65-6849 


45 


2-8063 


6-6952 


12-5727 


20-4388 


30-2935 


42-1368 


55-9687 


■46 


2-8478 


6-7625 


12-6700 


20-5703 


30-4634 


42-3486 


86-2310 


•47 


2-8878 


6-8264 


12-7610 


20-6915 


30-6180 


42-6403 


56-4585 


■48 


2-9263 


6-8872 


12-8461 


20-8033 


30-7587 


42-7122 


56-6638 


•49 


2-9640 


6-9454 


12-9.263 


20-9069 


30-8870 


42-8667 


56-8459 


•50 


3-0000 


7-0000 


130000 


21-0000 


31-0000 


43-0000 


57-0000 
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p 


x = l 


x = 2 


x = 3 


a;=4 


x=5 


x = 6 


x=l 


■51 


30348 


70516 


130679 


210839 


31^0994 


43^1145 


571291 


■52 


3^0683 


7^1000 


13^1299 


2M581 


311843 


43^2088 


57-2314 


•53 


3-1002 


7^1450 


13^1858 


21-2225 


3b2552 


43^2835 


57-3079 


■54 


31306 


7^1869 


132358 


21-2777 


313122 


433402 


57-3644 


■55 


31596 


7^2250 


132791 


21-3218 


313531 


43-3730 


57-3815 


•56 


3'1S70 


7^2599 


13^3164 


21-3565 


313801 


433873 


57-3781 


•57 


32130 


7^2914 


133473 


21-3809 


313920 


433807 


57-3469 


■58 


32371 


73190 


13^3716 


21-3947 • 


313885 


433529 


57-2880 


•59 


32600 


73432 


13-3893 


21-3981 


313696 


433039 


67-2007 


■60 


3^2804 


7^3630 


13-3992 


21-3890 


313325 


432298 


57-0808 


■61 


33000 


73800 


13-4042 


21-3723 


312845 


43^1406 


56-9409 


•62 


33174 


7-3925 


13-4006 


21-3421 


31-2167 


430245 


56-7654 


•83 


33329 


7-4012 


13-3910 


21-3020 


31-1346 


42^8886 


56-5641 


•64 


33464 


7-4051 


13-3722 


21-2477 


31-0315 


42^7238 


56-3245 


•65 


33581 


7-4052 


13-3469 


211831 


30-9138 


42^5392 


56-0588 


■66 


33676 


7-4011 


13-3143 


211071 


30-7796 


423320 


55-7639 


■67 


33749 


7-3915 


13-2716 


210150 


30-6219 


420921 


55-4258 


■68 


33802 


7-3784 


13-2231 


20^9141 


30-4516 


41-8356 


56-0660 


•69 


3-3830 


7-3595 


131642 


20^7967 


30-2574 


41-5460 


54-6628 


•70 


33835 


7-3357 


130966 


20^6660 


30-0439 


41-2304 


54-2252 


71 


33813 


7-3066 


13-0195 


20^5204 


29-8090 


40-8854 


53-7494 


■72 


33768 


7-2723 


12-9340 


20-3616 


29-5550 


40-6145 


53-2397 


•73 


33689 


7-2307 


12-8370 


201849 


29-2755 


40-1086 


52-6843 


•74 


33586 


7-1853 


12-7301 


199928 


28-9736 


39-6724 


52-0891 


•75 


33450 


7-1328 


12-6124 


197842 


28-6481 


39-2040 , 


61-4521 


•76 


33283 


7-0732 


12-4829 


19-5570 


28-2959 


38-6994 


50-7675 


■77 


33083 


7-0068 


• 12-3413 


193114 


27-9173 


38-1589 


50-0361 


■78 


32839 


6-9324 


12^1858 


190442 


27-5078 


37-5764 


49-2603 


•79 


32563 


6-8506 


120178 


18^7581 


27-0713 


36-9573 


48-4163 


•80 


32243 


6-7603 


11^8352 


184491 


26-6020 


36-2940 


47-5249 


•81 


31875 


6-6603 


U^6360 


18^1148 


26-0965 


36-5816 


46-5695 


•82 


31467 


6-5515 


lh4218 


17^7574 


25-5585 


34-8246 


45-5563 


•83 


3^1007 


6-4330 


111912 


173752 


24-9850 


340200 


44-4814 


•84 


3-0495 


6-3044 


10^9434 


16^9667 


24-3741 


33-1657 


43-3415 


•85 


2-9907 


6-1603 


106694 


16^5178 


23-7056 


32-2328 


42-0996 


•86 


29263 


6-0055 


10-3772 


160415 


22-9985 


31-2483 


40-7906 


•87 


2^8545 


5-8355 


10^0596 


15^5266 


22-2366 


30-1896 


39-3867 


•88 


2^7751 


5-6511 


9^7177 


14^9749 


21-4227 


29-0614 


37-8906 


•89 


2^6859 


5-4471 


9-3428 


14-3731 


20-5379 


27-8373 


36-2716 


•90 


2^5870 


5-2243 


8-9367 


13-7242 


19-5869 


26-5246 


34-5374 


•91 


2^4769 


4-9801 


8-4951 


130219 


18-5605 


25-1109 


32-6731 


•92 


2-3528 


4-7085 


8-0078 


12^2507 


17^4370 


23-5670 


30-6406 


•93 


2-2133 


4-4076 


7-4720 


11^4067 


16^2115 


21-8865 


28-4316 


•94 


2-0554 


40713 


6-8782 


10^4757 


14^8639 


20-0429 


26-0124 


•95 


1-8743 


3-6911 


6-2118 


94362 


133645 


17-9966 


23-3324 


•96 


1-6626 


3-2532 


5-4509 


82559 


1V6682 


15-6877 


20-3147 


•97 


13971 


27122 


4-5211 


68236 


96196 


12-9092 


16-6923 


•98 


1^1107 


21363 


3-5383 


53163 


7^4707 


10-0012 


12-9081 


•99 


•7089 


13451 


2^2065 


32934 


4-5056 


6-0433 


7-8063 
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p 


x= -7 


x=-S 


x= -5 


X=-4: 


x=-3 


x=-2 


x=-l 


x=0 


•01 


5^8579 


4-3203 


3^0080 


2-0210 


1-1595 


■5235 


-1127 


- ^0727 


•02 


97281 


7-1974 


5^0431 


3-2649 


1-8631 


■8376 


•1881 


-•0851 


•03 


12-5809 


9-2914 


6^4954 


4-1930 


2-3841 


1^0686 


•2469 


- ^0813 


•04 


153119 


11-2923 


7-8800 


5-0749 


2-8773 


1-2868 


•3036 


-■0723 


•05 


17^5872 


12-9552 


9-0269 


5-8024 


3-2818 


1-4649 


■3519 


- ^0574 


•06 


196082 


14-4293 


10-0411 


6-4437 


3-6368 


1-6207 


•3954 


- 0392 


■07 


214324 


15-7575 


10-9527 


7-0181 


3-9536 


1-7592 


•4361 


-■0189 


■08 


230983 


16-9684 


11-7820 


7-5391 


4-2398 


1-8841 


•4718 


•O031 


•09 


24-6313 


18-0809 


125423 


8-0155 


4-5005 


1-9973 


•6069 


-0263 


•10 


260376 


19^0998 


132371 


8-4496 


4-7373 


2-0999 


•5376 


•0505 


•11 


27^3456 


20^0459 


138811 


8-8507 


4-9560 


2-1939 


•5673 


-0753 


•12 


28^5668 


20^9282 


144803 


9-2231 


5-1566 


2-2805 


•6963 


-1007 


•13 


29^6947 


21^7416 


150316 


9-5646 


5-3406 


2-3595 


•6216 


-1265 


•14 


30^7546 


22^5049 


155479 


9-8836 


5-5118 


2-4327 


•6463 


•1526 


•16 


317421 


23^2148 


16^0270 


10-1786 


6-6696 


2-4999 


•6697 


■1789 


•16 


326789 


23-8875 


16-4801 


10-4569 


6-8178 


2-5628 


•6921 


•2065 


•17 


33^5392 


24-5036 


16-8942 


10-7102 


6-9520 


2-6196 


•7129 


•2319 


•18 


343503 


25-0840 


17-2831 


10-9474 


6-0772 


2-6723 


•7327 


•2584 


•19 


351151 


25-6302 


17-6483 


11-1696 


6-1938 


2-7211 


•7515 


•2850 


•20 


358361 


26-1442 


17-9912 


11-3773 


6-3024 


2-7666 


•7695 


•3116 


■21 


36^5089 


26-6229 


18-3097 


11-5695 


6-4022 


2-8080 


•7865 


•3381 


■22 


371383 


27-0696 


18-6060 


11-7474 


6-4942 


2-8458 


•8025 


-3643 


■23 


37^7315 


27-4899 


18-8841 


11-9140 


6-5796 


2-8808 


■8179 


-3906 


■24 


38^2837 


27^8802 


19-1413 


12^0670 


6-6676 


2-9126 


■8323 


-4166 


■25 


38^8005 


28^2446 


19-3807 


12^2088 


6^7292 


2-9416 


■8460 


•4425 


■26 


392815 


28^5828 


19-6020 


12-3392 


6^7945 


2-9677 


•8690 


•4682 


•27 


397311 


28^8980 


19-8075 


12-4595 


6^8542 


2-9906 


■8713 


•4938 


•28 


40^1505 


29^1911 


19-9976 


12-5702 


6^9084 


3-0127 


-8830 


•5191 


•29 


40-5354 


294592 


20-1706 


12-6698 


69567 


3-0314 


■8939 


•5441 


•30 


40-8950 


29-7086 


20-3307 


12-7614 


7^0004 


3-0481 


■9043 


•5690 


•31 


41-2256 


29-9368 


20-4762 


12-8435 


70390 


30625 


■9140 


•5935 


•32 


41-5304 


30-1464 


20-6088 


12-9177 


70731 


30760 


■9232 


•6179 


•33 


41-8022 


30-3319 


20-7251 


12-9816 


7^1016 


30849 


■9317 


•6418 


•34 


42-0577 


30-5056 


20-8330 


13-0401 


7^1269 


3-0935 


■9398 


•6667 


•35 


42-2810 


30-6558 


20-9250 


13-0889 


7^1471 


3-1000 


■9473 


•6892 


•36 


42-4819 


30-7896 


21-0057 


13-1303 


7^1632 


3^1046 


-9642 


•7123 


•37 


42-6633 


30-9092 


21-0766 


13-1654 


7^1768 


31076 


-9607 


•7353 


•38 


42-8156 


31-0079 


21-1333 


13-1919 


7^1836 


31086 


•9666 


•7578 


•39 


42-9487 


31-0924 


21-1803 


13-2121 


7^1880 


31080 


■9720 


•7800 


•40 


43-0548 


31-1576 


21-2141 


13-2242 


7^1880 


3-1056 


■9768 


•8018 


•41 


43-1459 


31-2117 


21-2402 


13-2315 


71853 


3-1020 


■9814 


-8235 


•42 


43-2124 


31-2479 


21-2541 


13-2309 


7^1784 


30966 


■9863 


-8447 


•43 


43-2573 


31-2687 


21-2576 


13-2241 


7^1681 


3-0896 


■9888 


-8656 


•44 


43-2815 


31-2743 


21-2507 


13-2107 


7^1542 


30811 


■9918 


-8860 


•45 
•46 


43-2847 


31-2648 


21-2336 


13-1908 


7^1367 


3-0712 


-9943 


•9060 


43-2736 


31-2422 


2b2070 


13-1653 


7^1162 


3-0599 


-9964 


•9257 


•47 


43-2305 


31-2019 


21-1694 


13-1327 


70920 


3-0470 


-9980 


•9449 


■48 


431732 


31-1488 


21-1225 


13-0945 


7^0646 


3-0326 


■9991 


•9636 


•49 


430967 


31-0817 


21-0662 


13-0503 


70339 


3-0170 


■9998 


•9821 


•50 


43^0000 


31^0000 


2b0000 


13-0000 


7^0000 


3-0000 


bOOOO 


10000 
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■p 


x=~l 


a;=-6 


x= -5 


x=-4 


x=-3 


x=-2 


x=~l 


x=0 


•51 


42^8843 


30-9047 


20-9246 


12-9441 


6-9631 


2-9816 


■9998 


10175 


•52 


42-7476 


30-7942 


20-8389 


12-8817 


6-9227 


2-9618 


■9991 


1^0346 


•53 


42-5935 


30-6711 


20-7446 


12-8141 


6-8796 


2-9408 


•9980 


1^0511 


•54 


42-4230 


30-5334 


20-6410 


12^7407 


6-8332 


2-9183 


•9964 


1-0671 


•55 


422251 


30-3818 


20-5271 


12-6610 


6-7835 


2-8946 


•9943 


1-0826 


•56 


42^0115 


30-2161 


20-4041 


12^5757 


6-7308 


2-8696 


•9918 


1-0976 


•57 


4b7777 


30-0357 


20-2712 


12^4843 


6-6749 


2-8430 


•9888 


1-1121 


•58 


415252 


29-8419 


20-1293 


123873 


6-6160 


2-8154 


■9853 


1-1259 


•59 


412511 


29-6327 


19-9770 


12-2841 


6-5537 


2-7862 


■9814 


1-1393 


•60 


40-9540 


29-4070 


19-8135 


12-1738 


6-4876 


2-7554 


■9768 


1-1518 


•61 


40-6451 


29-1728 


19-6445 


12-0603 


6^4202 


2-7242 


■9720 


1-1640 


•62 


40-3102 


28-9201 


19-4631 


11-9393 


6-3486 


2-6909 


■9666 


M754 


•63 


39-9587 


28-6554 


19-2736 


11-8132 


6-2742 


2-6568 


■9607 


1^1861 


■64 


39-5793 


28-3708 


19-0707 


11-6789 


6-1956 


2-6207 


■9542 


11961 


•65 


391834 


28-0744 


18-8600 


11-5401 


6-1145 


2-5836 


■9473 


1^2054 


•66 


38-7691 


27-7650 


18-6406 


11-3959 


6-0307 


2-5455 


■9398 


12139 


•67 


38^3240 


27-4335 


18-4063 


11-2426 


5-9422 


2-5053 


■9317 


12216 


•68 


37^8666 


27-0932 


18-1664 


11-0859 


5-8519 


2-4644 


■9232 


1^2285 


•69 


37^3802 


26-7324 


17-9126 


10-9209 


5-7572 


2-4217 


■9140 


1-2345 


•70 


36-8714 


26-3556 


17^6483 


10-7496 


5-6592 


2-3775 


■9043 


1-2396 


•71 


36-3382 


25-9614 


173724 


10-5712 


5^5577 


2-3318 


■8939 


1-2437 


•72 


35-7841 


25-5525 


17^0868 


10-3870 


5-4530 


2-2851 


■8830 


1-2469 


•73 


35-2007 


25-1226 


16-7871 


10-1943 


5-3440 


2-2373 


■8713 


1-2488 


•74 


34-5925 


24-6752 


16-4760 


9-9948 


52315 


2-1863 


■8590 


1-2498 


■75 


33^9589 


24-2098 


16^1529 


9-7880 


5-1164 


2-1346 


■8460 


12496 


•76 


332959 


23-7236 


15-8161 


9-5732 


49949 


20814 


■8323 


12480 


•77 


32-6046 


23-2175 


15-4661 


9-3506 


4-8705 


2^0264 


■8179 


1^2452 


•78 


31-8801 


22^6878 


15-1006 


9-1184 


4-7414 


19694 


■8025 


1-2407 


■79 


31 1275 


221383 


14-7221 


8-8787 


4-6084 


1-9110 


■7865 


12349 


■80 


30-3409 


21^5648 


14-3278 


8-6297 


4-4706 


1-8506 


•7695 


12274 


■81 


29-5165 


20^9646 


13-9159 


8-3702 


4-3276 


17879 


•7515 


1^2180 


•82 


28-6593 


20-3416 


13-4891 


8-1020 


4-1802 


1-7239 


•7327 


1^2070 


•83 


27-7676 


19-6940 


13-0464 


7-8244 


4-0282 


1-6578 


•7129 


1-1939 


•84 


26-8397 


19-0215 


12-5873 


7-5373 


3^8714 


1-6896 


•6921 


11787 


•85 


25-8521 


18-3066 


12-1004 


7-2336 


3^7062 


1-5183 


•6697 


l^ld05 


•86 


24-8298 


17-5675 


11-5979 


6-9211 


3-5368 


1-4451 


•6463 


11400 


•87 


23-7543 


16-7914 


11-0714 


6-5944 


3-3604 


1-3696 


•6215 


1-1165 


•88 


22-6316 


15-9822 


10-5235 


6-2555 


3-1781 


1-2913 


•5953 


1-0899 


•89 


21-4414 


15-1257 


9-9451 


5-8987 


2-9870 


1-2099 


•6673 


1-0593 


•90 


20-1920 


14-2284 


9-3401 


5-5268 


2-7887 


11265 


•5376 


1-0247 


•91 


18-8761 


13-2849 


8-7055 


5-1379 


2-5821 


10381 


•5059 


■9855 


•92 


17-4739 


12-2814 


8-0324 


4-7269 


2-3650 


-9467 


•4718 


■9405 


•93 


15^9844 


11-2175 


7-3207 


4-2941 


2-1376 


■8512 


•4351 


•8891 


•94 


14^3928 


10-0831 


6-5641 


3-8359 


1-8982 


■7515 


•3954 


•8300 


•95 


12-6762 


8-8626 


5-7529 


3-3470 


1-6448 


■6465 


•3519 


•7612 


•96 


10-8011 


7-5333 


4-8728 


2-8195 


1-3737 


-5350 


•3036 


•6795 


•97 


8-6423 


6-0092 


3-8696 


2-2236 


1-0711 


-4122 


•2469 


•5751 


•98 


6-4503 


4-4660 


2^8579 


1-6261 


-7705 


-2913 


•1881 


•4613 


•99 


3-6335 


2-4665 


15250 


-9088 


-4181 


-1527 


•1127 


•2981 
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A LIST OF DEFINITE INTEGRALS OF FREQUENT 
OCCURRENCE IN PROBABILITY WORK 



zero. 





i-OO /-OO 

JO ^ J -oo 



c= r z^e-^'dz^^ . r = 

Jo * ,/ -00 



■y/iT 



z^e~'^dz= ^, I = zero. 

JO J -00 

["^ .= j 1.3.5... (n- 1)^7^. , 

= j^ ^e- dz = 2.2.2... 2 ^ (** "^"") 



'o 



°^ 2.2.2. 2...2 ('^ °'^*^)- 
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-/I 
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but is not ^ . 
H=r e-<^+^^^+«^»>d2 = ^ e-^^ , but f " is not f . 

J -or, VV -'O ^ 

iiPQ> R\ but f f is wo« ^ . 

Z = e ^ zdz = \- e Q , but is not -^■ 
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J-coJ-« 2(PQ-R^f 

/■OO /•OO T 

but I is not J . 

AT = f" r -(P^+^mt+QV) ^2d^ ^ ^ ^ ^ ,iiPQ> n\ 

J-o.J-00 ^ 2iPQ-R^f 



Note. In the above integrals P and Q do not, as often, represent 
constants whose sum is unity. R is not a correlation coefficient, although 
it is closely connected therewith. 



APPENDIX III 

NOTATION 

A simple, consistent, and easily remembered notation is an essential 
in any mathematical treatment. The chief difficulty in the way of 
obtaining consistency in a work like the present is the desire naturally 
felt to keep any piece of research in the notation used by its originator. 
But a great effort has been made to attain a notation which could be 
used throughout the book without too seriously altering the appearance 
of any well-known formula. Most of the symbols in the following 
Table have been used by several important writers with the meaning 
here indicated. They are either always used with this meaning: or if 
there be any exception it is hoped that the context will make clear what 
is meant. 

The greatest difficulty has been felt over the choice of a symbol 
for a stimulus value. The commonest symbols hitherto used have been 
R and S for stimulus and sensation intensity respectively, and r has 
been used instead of R by at least one very important writer. But 
there is still greater unanimity of usage in employing r and R for correla- 
tion coefficients. In adopting small s for stimulus, we realise that there 
is some danger of confusion with great S used in the past, by writers 
on Weber's Law, for sensation intensity. But some of the same writers 
also use S for a standard stimulus (as we do) : and after the first chapter 
we have no necessity to speak of sensation-intensities at all, it is frankly 
physical stimuli which are measured. In the first chapter itself ambiguity 
or misunderstanding is avoided by printing the words at full length. 

It is hoped that this list of symbols may assist in obtaining unanimity 
and consistency in the notation of the subject and serve as a convenient 
reference. 
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Symbols Used 

Symbol Meaning 

a a mean, m is also used for a mean, and any quantity with a bar, as x, is a mean. 

A a true value. 

Oj, Oj, a, readings of which a is the mean. 

b a coefficient of legreasion. Usually suffixes are added. 

c and C constants, k is also used in this sense. 

Ci, Cj, and contingency symbols. 

5 a deviation: also used for a finite increment, d indicating an infinitesimal 

increment, 
e an error. 

h modulus of precision in one notation of the Gaussian or Normal Curve of Error. 

k a constant. Also c and C. See also n. 

I see n. 

m a mean. See also n. 

M MiiUer's weights. See w. 

fi and V moments. 
N see n. 

n a number (of experiments, or the like). Where necessary, N, and also h, I, to, 

are used in this sense, m more commonly is used for a mean. 
p a mathematical measure of probability. 

P same as p. 

Q same as p. Usually Q is I -Pi also used, usually with a suffix, for a quartile. 

q same as p. Usually qia 1 -p. 

r a correlation coefficient. 

B a correlation coefficient. 

71 a correlation ratio. 

s a stimulus value. Sis a, standard stimulus, F a variable, s is also on one occasion 

used for a standard deviation. 
<r a standard deviation. Also Z is used, and on one occasion s. 

S a standard deviation. Would also be used for a summation of any kind other 

than that for which S is used in this book; but no such occasion arises in 

the present work. 
S a sum over the distribution, e.g. iS (e) is the sum of the errors. S is also used for 

a standard stimulus, see small s and also the remarks preceding this table. 
t t, with X, y and z, is used as a variable of integration. 

T Thresholds, both absolute and difference. Sometimes with suffixes as T„ and 

Ti for upper and lower difference thresholds or hmina. 
u u and v "'residuals" in observation equations. 

V see u. 

V variable stimulus, see s. Also coefficient of variation. 

w "weight" of a reading or an equation. Also W, and M, the latter for MiiUer's 

weights only. 
W see w. 

X in connection with goodness of fit. 

X x,y,z and t are used as variables of integration. 

« see X. 
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